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The Si-5i0. Interface — Electrical Properties 
as Determined by the Metal-Insulator- 
Silicon Conductance Technique 


By E. H. NICOLLIAN and A. GOETZBERGER 
(Manuscript received December 28, 1966) 


Measurements of the equivalent parallel conductance of metal-insulator- 
semiconductor (MIS) capacitors are shown to give more detailed and 
accurate information about interface states than capacitance measurements. 
Experimental techniques and methods of analysis are described. From the 
results of the conductance technique, a realistic characterization of the 
Si—Si0, interface is developed. Salient features are: A continuum of states 
is found across the band gap of the silicon. Capture cross sections for holes 
and electrons are independent of energy over large portions of the band gap. 
The surface potential is subject to statistical fluctuations arising from 
various sources. The dominant contribution in the samples measured arises 
from a random distribution of surface charge. The fluctuating surface 
potential causes a dispersion of interface state time constants in the deple- 
tion region. In the weak inversion region the dispersion is eliminated by 
interaction between interface states and the minority carrier band. A single 
time constant results. From the experimentally established facts, equivalent 
circuits accurately describing the measurements are constructed. 


I. INTRODUCTION 


The electrical properties of semiconductor surfaces have been 
studied for a long time. Until recently most investigators were con- 
cerned with either etched surfaces covered by a thin natural oxide 
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layer or with ultraclean surfaces that exist only in high vacuum. 
Development of improved oxidation techniques for silicon have made 
it now possible to study the electrical phenomena occurring at the 
interface between silicon and silicon dioxide. These studies resulted in 
the recognition that a semiconductor-insulator interface behaves dif- 
ferently in many ways from a “bare” surface. Of practical importance 
is the fact that an oxide-covered surface is more stable and can be 
made electrically more perfect than an unprotected surface. 

The difference between oxidized and bare surfaces will be listed here 
briefly. 

(1) There is no charge exchange with states at the air oxide inter- 
face. This type of state, the “slow surface state,” has a long time 
constant and is encountered on etched surfaces. For thick oxide layers, 
only states at the oxide-semiconductor interface have to be considered. 
They are equivalent to “fast states” and will be called interface states 
in this paper. In addition to interface states, there may be traps within 
the oxide.t;? Not much is known about these traps and they will not 
be a major topic of this paper. 

(7) All oxidized silicon surfaces contain a so-called surface charge. 
It consists of positive charges residing close to the interface. The 
surface charge originates from two sources, alkali ions® and built-in 
charges.* > Alkali ions can drift through the oxide at fairly low tem- 
peratures and thus be distinguished from built-in or residual charges 
which are fixed and have a total density which depends on oxidation 
rate and crystal orientation.* The present paper deals with interface 
states in alkali-free systems. Reference to residual surface charge will 
be necessary in connection with their influence on the uniformity of 
surface potential. 

(iit) Thermally oxidized Si-Si0. interfaces are characterized by a 
much lower density of states than “bare” surfaces. The density of 
states over most of the silicon band gap in the oxidized systems is 
usually not much greater than 10%? cm™?-eV- or smaller than 
10*° cm-?-eV-! depending on the method of oxide preparation. For 
“bare” surfaces, density of states can be as high as 10** cm--eV7. 

-The electrical properties of interface states are characterized by 
their density, their position in the energy gap of the silicon, and their 
capture cross section. In addition, it is necessary to know whether they 
are of the donor or acceptor type. The most widely used tool for 
investigating these interface state properties is the metal-insulator- 
semiconductor (MIS) capacitor. 

Dispersion of the capacitance can be used to obtain information 
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about the energy distribution and density of interface states as has 
been shown by Terman.’? The capacitance technique, however, has 
severe limitations.* Essentially, the difficulty is that interface state 
capacitance must be extracted from measured capacitance which con- 
sists of oxide capacitance, depletion layer capacitance, and interface 
state capacitance. This difficulty does not apply to the equivalent 
parallel conductance because conductance arises solely from the 
steady-state loss due to the capture and emission of carriers by inter- 
face states and is thus, a more direct measure of these properties.° 
Conductance measurements yield more accurate and reliable results, 
particularly when the density of interface states is low as in the 
thermally oxidized system because only directly measured quantities 
are used with no matching to a theoretical curve required. Both the 
capacitance and equivalent parallel conductance as functions of 
voltage and frequency contain identical information about interface 
states. Greater inaccuracies arise in extracting this information from 
the capacitance. This is illustrated in Fig. 1 which shows capacitance 
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Fig. 1— Capacitance and equivalent parallel conductance measured at 5 kHz 
and 100 kHz on a p-type sample having an acceptor density of 2.08 x 10°° cm™ 
and interface state density in the 10"%cem~*-eV~ range. Interface state time con- 
stants and densities are given in Fig. 14 and Fig. 15 for this sample. 
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and equivalent parallel conductance measured at 5 kHz and 100 kHz. | 
The largest capacitance spread is 14 percent while the magnitude of 
the conductance peak increases by over one order of magnitude in this 
frequency range. 

In order to evaluate MIS measurements with regard to interface 
states, it is desirable to have one-dimensional current flow perpendicu- 
lar to the interface. This condition does not apply when the silicon 
underneath the field plate and beyond is inverted. Then, the capaci- 
tance and conductance are dominated by the lateral ac current flow 
that has been explained elsewhere.” 11 Because of the positive polarity 
of surface charge, p-type silicon is normally inverted and is therefore 
most easily investigated in the depletion-accumulation region. 

In the MIS capacitor, the density of states is normally low so that 
the largest silicon band bending possible is not limited by their 
influence. Maximum possible band bending in the MIS capacitor is 
slightly less than the band gap. In this paper, interface state properties 
are described for energies in the forbidden gap within the range 2up 
[see Fig. 5(a) ] only at room temperature. 

Lehovec and Slobodskoy?* theoretically treat losses due to steady 
state capture and emission of both minority* and majority* carriers 
by a hypothetical single level interface state for the cases of zero and 
infinite recombination-generation rate through states in the silicon 
space-charge region. No direct experimental measurement and evalua- 
tion of these losses has been made until recently.° The theory and 
equivalent circuits developed in Refs. 12 and 13 constitute a very gen- 
eral description of all the losses and charge storage processes possible 
in an MIS structure. The gap between this general theory and what is 
actually measured in a sodium-free MIS structure is bridged in this 
work. It can be shown by conductance measurements that: 

(7) loss in the oxide measured at room temperature is negligible, 

(iz) in the samples studied, the dominant loss is by capture and 
emission of carriers by interface states rather than states in the silicon 
space charge region, and 

(wi) the majority carrier time constant is always the dominant one. 
These three results make it possible to study the capture and emission 
properties of the interface states for each type of carrier independently. 
It is this fact which makes the MIS capacitor a powerful tool. We 
shall divide the range of surface potential from accumulation to 

* The term minority carrier denotes those carriers which are in a minority in 


the bulk of the semiconductor crystal. The term majority carrier denotes those 
in a majority in the bulk of the crystal. 
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inversion into four regions each of which is characterized by a different 
equivalent circuit. These regions are: depletion-accumulation, mid-gap, 
weak inversion, and strong inversion. The depletion-accumulation 
region includes those values of surface potential less than wz where up 
is shown in Fig. 5(a). Mid-gap is the region a few kT’/q volts wide 
about wz. Weak inversion is between uz and 2ug. Strong inversion 
includes those values of surface potential greater than 2u,z . The follow- 
ing results have been obtained using the MIS conductance technique. 

(z) A continuum of states across the bandgap of the silicon appears 
to be characteristic of the Si-SiO.» interface. 

(tt) Capture cross sections for holes and electrons are independent of 
energy over large portions of the bandgap. 

(122) Time constant dispersion at each bias in the depletion-accumu- 
lation region is dominated by statistical fluctuations of surface poten- 
tial. In the samples measured, these fluctuations are caused primarily 
by a random distribution of built-in residual oxide charges and charged 
interface states over the plane of the interface. 

(tv) A single time constant at each bias is observed in the region 
of weak inversion because the minority carrier capture resistance be- 
comes negligibly small and the inversion layer provides a large lateral 
conductance across the silicon surface. 

(v) Equivalent circuits are derived for the depletion-accumulation, 
mid-gap, and weak inversion regions. (See Fig. 31) The equivalent 
circuit which applies in strong inversion has been worked out pre- 
viously." 34 This circuit was proposed in Ref. 11 for the case where 
lateral ac current flow is absent and has been verified by experiment 
using n-type samples.*4 


II. SAMPLE PREPARATION 


The sample preparation processing described in this section is in- 
tended to yield oxides which remain stable only during admittance- 
voltage measurements made at room temperature or lower. The sta- 
bility requirements for device applications would be more stringent 
than this and would require somewhat different processing. No attempt 
is made to minimize built-in charge density. Both high and low inter- 
face state densities are purposely produced. 

Measurements were made on both boron and arsenic doped silicon 
oriented in the [111] and [100] directions. Samples were prepared by 
first growing a 10-» thick epitaxial layer having a nominal resistivity 
of 1 ohm-cm on a 0.005 ohm-cm substrate. The substrate always in- 
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corporates the same dopant as the epitaxial layer. The purpose of 
using an epitaxial layer is to minimize bulk series resistance to the 
point where it can be neglected—an important consideration when 
making conductance measurements. 

The slices about 1 cm square were removed from the epitaxial 
grower and immediately placed in the oxidation apparatus described 
in Ref. 4. An oxide typically between 500 A and 700 A thick was then 
grown at 1000°C in steam at atmospheric pressure by the bias tech- 
nique of Ref. 4. 

Another group of samples was prepared by growing about the same 
thickness of oxide the same way except in dry oxygen at 1000°C 
and atmospheric pressure. Trace quantities of Hz were removed from 
the O. stream by first passing it through a Deoxo* unit. The O2 was 
then dried by passing it through a dry ice acetone trap. The oxide 
should be as thin as possible without being in the tunnelling range 
to make the measured admittance correspond more closely to the 
admittance of the interface. Oxidizing the sample immediately after 
the epitaxial layer is grown insures that the silicon surface has not 
had much opportunity to gather contamination from its surroundings. 
The oxide was grown by the bias technique to obtain an oxide in which 
ionic contamination is a minimum. The reason for taking pains to 
minimize ionic contamination in the oxide is that the presence of an 
ion near an interface state may perturb its energy and capture proba- 
bility although such ions do not themselves act as interface states.’ 

Immediately after oxidation, the slice was placed in an oil diffusion 
pump evaporator and a number of chrome-gold field plates were 
evaporated through a metal mask held in intimate contact with the 
oxide surface. First, a few hundred A of chromium was deposited to 
make a strong bond to the oxide. This was followed by 2000 A of 
gold to provide good electrical contact. 

The chromium and gold sources were heated to evaporation tem- 
perature in adjacent tungsten baskets. The baskets are 15 cm above 
the surface of the sample to reduce penumbra effects.§ Since not only 
capacitance but also equivalent parallel conductance depends on the 
area of the field plate, a large field-plate diameter is used to get a 
large admittance thereby increasing detection sensitivity. However, 
making the field-plate diameter large increases the likelihood of 
encompassing defects and gross nonuniformities under it. A satisfac- 
tory diameter with these considerations in mind can be as large as 


*This unit contains a catalyst which enables Hz and Os to react at room 
temperature producing H:.O. 
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1500 ». The oxide on the substrate side was then etched off in HF. 
Cr—Au in the same proportions as in the field plate was evaporated all 
over this side to make the back contact. The slice was placed in good 
thermal contact with a massive copper block during all the evapora- 
tion steps to keep it always at room temperature. This minimizes 
sodium migration into the oxide during evaporation. Minimizing so- 
dium contamination in this step of the processing also reduces the 
likelihood of a patchy charge distribution underneath the field plate. 


III. ORIGIN OF THE EQUIVALENT PARALLEL CONDUCTANCE 


Two experiments are described which show that the dominant 
process causing the measured equivalent parallel conductance is 
capture and emission of carriers by interface states. Series resistance 
is not important because it can be made negligible by using epitaxial 
samples or it can be measured in strong accumulation and substracted 
from the total impedance. Normally, small-signal measurements are 
made so that harmonics of the signal frequency arising from the non- 
linearity of the charge-voltage characteristic are unimportant (see 
Section 5.1). 


3.1 Loss in the Oxide 

The first experiment shows that equivalent parallel conductance 
measured at room temperature does not arise from loss in the oxide. 
Capacitance and equivalent parallel conductance for a p-type sample 
are plotted as functions of field plate bias in Fig. 2 before and after 
aging an unstable oxide in room air at 150°C. This aging produces an 
increased positive charge density in the oxide as seen by the shift of 
the capacitance curve to higher negative bias. This means that the 
electric field in the oxide resulting from the applied bias is greater for 
a given silicon surface potential after aging. However, the conductance 
curve has not changed appreciably in shape or magnitude. The con- 
ductance curve is shifted to higher negative bias by the same amount 
as the capacitance curve. This is seen by the fact that the peak of the 
conductance occurs at the same value of capacitance before and after 
aging. Therefore, the conductance is a function of silicon surface poten- 
tial and not of field in the oxide. Therefore, it is not related to loss in 
the oxide. 


3.2 Loss in the Silicon Space-Charge Region 
The most important processes which can give rise to the measured 
conductance are capture and emission of carriers by interface states or 
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by states in the silicon space-charge region. It is interesting to note 
that the MIS capacitor can be used to measure the properties of either 
type of state provided one or the other dominates the loss processes.* 
Processing determines the densities and positions in the bandgap of 
levels of each type of state. In all samples made as described in Sec- 
tion II, interface state loss is dominant over the entire measurable 
range. In all these samples, no impurities other than arsenic or boron 
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Fig. 2— Measured capacitance and equivalent parallel conductance as func- 
tions of field plate bias before and after aging at 150°C in room air. Field plate 
diameter is 3.8 X 10-°cm, silicon resistivity 50 Q-cm, p-type, crystal orientation 
[111], oxide thickness 1200 A, and measurement frequency 100 kHz. 


are purposely introduced into the silicon. The densities and energy 
levels of other impurities or defects inadvertently introduced into the 
silicon during fabrication of the MIS capacitor are unknown. The 
same is true for interface states. It is best therefore, to establish by 
experiment which effect dominates. This is described next. 

The magnitude of the equivalent parallel conductance measured is 
always much smaller when the oxide is grown in steam than when the 


*Sah15 has used the MIS capacitor to study states introduced into the silicon 
by heavy gold doping. 
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oxide is grown in dry oxygen, other parameters being the same.* In 
addition, the oxide can be reversibly cycled from the wet to the dry 
condition by heat treatment between 200°C and 400°C in various 
ambients. To enable a quantitative comparison, it is necessary that 
the resistivity of the silicon, oxide thickness, and field plate area be the 
same for both conditions. The easiest way to accomplish this is to 
cycle the same capacitor from the wet to a drier condition. Curves 
(a) and (b) in Fig. 3 are plots of capacitance and equivalent parallel 
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Fig. 3— (a) capacitance and (b) equivalent parallel conductance vs field plate 
bias before drying; (c) capacitance and (d) equivalent parallel conductance vs 
field plate bias of the same capacitor after heating in dry nitrogen for 17 hours 
at 350°C. Field plate diameter is 3.8 X 10-* cm, silicon resistivity 1 2-cm, p-type, 
erystal orientation [111], oxide thickness 910 A grown in stream, and measure- 
ment frequency 100 kHz. 
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conductance for a steam grown oxide on p-type silicon. Curves (c) and 
(d) in this figure are measured after heating the capacitor in dry 
nitrogen (dried by passing through a liquid nitrogen trap) for 17 hours 
at 350°C. A similar set of curves for n-type is given in Fig. 4. All the 
curves in these two figures are measured at 100 kHz. Curves similar 
to these can be obtained at all frequencies in the range investigated 
(50Hz to 500 kHz). It is important to note only that the capacitance 
in Fig. 3 from curve (c) associated with the conductance peak of curve 
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Fig. 4— (a) capacitance and (b) equivalent parallel conductance measured vs 
field plate bias before drying; (c) capacitance and (d) equivalent parallel con- 
ductance vs field plate bias of the same capacitor after heating in dry nitrogen for 
17 hours at 350°C. Field plate diameter is 3.8 X 10-°cm, silicon resistivity 0.75 Q-cm, 
n-type, crystal orientation [111], oxide thickness 600 A grown in stream, and 
measurement frequency 100 kHz. 
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(d) is nearly the same as the capacitance from curve (a) associated 
with the conductance peak of curve (b). The same is true in Fig. 4. 
Therefore, all the relevant parameters before and after drying are the 
same except the magnitudes of the conductance peaks. The peak con- 
ductance of curve (b) in Fig. 3 is about one-third the peak conductance 
of curve (d). This result cannot be due to sodium contamination 
which might have been introduced into the oxide during the drying 
cycle, the initial wet oxide being sodium free (see Section II). The 
reasons for this are: (2) Sodium at the interface always remains 
charged in the range between accumulation and strong inversion? 
Therefore, it does not become an interface state which could influence 
the conductance. (7%) A nonuniform charge distribution in the oxide 
caused by the sodium would spread the conductance curve over a 
larger range of bias which would decrease rather than increase the 
magnitude of the conductance peak if this effect were dominant. 
(1) The p-type curves in Fig. 3 are shifted towards more negative 
bias while the n-type curves in Fig. 4 are shifted towards less negative 
bias with drying. Introduction of sodium during drying would shift 
both curves towards more negative bias. The increase of the conduct- 
ance peak in both p and n-type is then a true drying effect. 

The fact that the oxide can be reversibly cycled from the wet to 
the dry state at low temperatures rules out the possibility that a large 
density of structural defects or impurities which act as bulk states is 
introduced into the silicon during one method of oxide preparation and 
not the other. All of this strongly suggests that the measured equiv- 
alent parallel conductance is due mainly to losses in states which 
reside on the oxide side of the interface and exchange charge at 50 Hz 
to 500 kHz with the silicon because only the properties of such states 
could be so markedly altered by low temperature wet-dry cycling. 

The most convincing evidence, however, is the dependence of inter- 
face state time constants on surface potential to be discussed later in 
this paper. 


3.3 Loss Due to Minority Carrier Transitions 

Minority carrier density can respond to the applied signal frequency 
by diffusion from the bulk to the interface and by recombination- 
generation processes through interface states and states in the silicon 
space charge region. These processes in the samples measured are too 
slow to allow the minority carrier density to follow 50 Hz to 500 
kHz.1? 14 It was indicated in the preceding part of this section that in 
this frequency range, interface state loss is solely responsible for the 
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measured equivalent parallel conductance. In the depletion region 
where minority carrier density is orders of magnitude less than 
majority carrier density, loss due to minority carrier transitions will be 
negligible compared to loss due to majority carrier transitions. Thus, in 
this region, the conductance measured arises simply from capture and 
emission of majority carriers by those interface states within a few 
kT/q of the Fermi level. In the region of weak inversion where 
minority carrier density is orders of magnitude greater than majority 
carrier density, minority carrier time constant becomes shorter than 
majority carrier time constant. Minority carriers will follow the signal 
frequency. However, current can flow and produce a loss only by 
transitions through the interface states from the minority carrier band 
to the majority carrier band. Therefore, this process will be controlled 
by the majority carrier time constant. This case will be treated in 
detail in Section IV. The important point is that the loss is determined 
entirely by the majority carrier time constant in both the depletion 
and weak inversion regions. 


IV. THEORY 


4.1 Phenomenological Description of Loss 

A continuum of interface states over the bandgap of the silicon 
appears to be characteristic of the Si-SiO. interface. Fig. 5(a) is a 
schematic of an MIS capacitor showing the band bending in depletion 
for n-type silicon with an arbitrary distribution of interface states in 
the silicon bandgap. Initially, let the interface states be in equilibrium 
with the silicon. Then let a small ac signal between 50 Hz and 500 kHz 
be applied. Consider the first half of the cycle which moves the con- 
duction band towards the Fermi level. This immediately increases the 
average energy of the electrons in the silicon. Because a conductance is 
observed, it is evident that the interface states do not respond im- 
mediately but lag behind. Therefore, electrons at a higher average 
energy in the silicon will be captured by interface states at a lower 
average energy. This results in an energy loss. On the other half of 
the cycle, the signal moves the valence band towards the Fermi level. 
Electrons in filled interface states now will be at a higher average 
energy than electrons in the silicon. As the electrons are emitted by the 
interface states into the silicon they will lose energy again. Thus, there 
will be an energy loss on both halves of the cycle which must be sup- 
plied by the signal source. The energy required for transitions between 
the band edge and an interface state is much higher than this and is 
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Fig. 5— (a) Schematic showing band bending in depletion at silicon surface in 
an n-type MIS capacitor. An arbitrary distribution of interface states is shown 
_for illustrative purposes. Majority carrier transitions to two levels within k7'/q 
of the Fermi level are also shown. f. is the Fermi function. (b) Band. bending 
in depletion showing point at which Fermi level crosses a shallow impurity level. 
This impurity level is hypothetical. W is space charge width. (c) Band bending 
in weak inversion showing point at which Fermi level crosses a hypothetical 1m- 
purity level located at mid-gap. 


supplied by phonons. Only majority carrier capture and emission is 
considered here because this is the dominant loss mechanism for 
reasons given in Section III. 

After a disturbance has been applied, the interface states will reach 
equilibrium with the silicon exponentially with a characteristic time, 
r. The interface states are also capable of storing charge. Thus, there 
will be a capacitance, C,, associated with them which is proportional to 
their density. The conductance related to the loss is G, = Cs/r. It 
should be noted that G, is independent of signal amplitude as long as 
C,, and r are independent of it. This will be the case for small-signal 
amplitude. Interface state 7 and C’, can then be extracted directly from 
a measurement of equivalent parallel conductance. Capture cross 
section is extracted from 7 and density of states from C, as will be 
shown in Section VII. 
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4.2 Admittance of a Single Level State 

Our purpose is to derive expressions for the admittance of the MIS 
capacitor as a function of bias and frequency, and to obtain the cor- 
responding equivalent circuits which fit in detail to experimental obser- 
vations. The capacitance-voltage characteristics for the MIS capacitor 
without interface states has been described.*® 1°17 Capacitance dis- 
persion caused by interface state loss was measured by Terman’ to 
obtain interface state densities and time constants. Lehovec et al,?? 
Bertz,1*> and Saht® developed equivalent circuits for a hypothetical 
single-level interface state from Shockley-Read statistics.18 The equiv- 
alent circuits given in this previous work are either too general?® +5 15 
or the measurement technique too inaccurate’ to give a real and com- 
plete description of interface state properties. To describe the MIS 
conductance technique and interpret the measurements in the depletion 
and weak inversion regions, we shall rederive the theory for the ad- 
mittance of the MIS capacitor. The starting point will be the theory 
for majority carrier capture and emission by a single-level interface 
state first calculated by Lehovec and Slobodskoy.*? 

Experimental evidence shows that only capture and emission of 
majority carriers are important when measuring in the depletion 
region. The quasi Fermi level and Fermi level are identical for major- 
ity carriers. Thus, application of an ac signal simply results in a time 
varying Fermi function. From Shockley and Read,*® the capture rate 
of electrons, taken as majority carriers, by a single-level interface state 
is 


R,(0) = N,c,{1 2 {Oln.@, (1) 


and the emission rate is 


G,(0) i Nef (d), (2) 


where JN, is the density of states cm™”; ¢, electron capture probability,* 


em’/sec, e, electron emission constant, sec™’, f(é) the Fermi function 
at time é, and n,(t) electron density at the silicon surface at time é, cm™*. 


Net current density flowing is 


1.() = QN.call — f]n.(t) — aN enf(d), (3) 


where gq is electronic charge in coulombs. Making the small-signal 
approximation, the admittance is, as is shown in Appendix A, 


* This capture probability is the average over all states in the conduction band 
near the silicon surface. 
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2 
Ao @? NDS Tal. 
By 9" erie oh fem) @) 
where j = —1, w the angular frequency of the ac signal, sec”, 


k Boltzman’s constant, eV X coul/°K, T the absolute temperature, °K, 
f, the Fermi function established by the bias, f, = [1 + exp (uw — u,)]7’, 
and n,, the electron density at the silicon surface established by the 
bias, em’ *. 

Equation (4) is the admittance of a series RC network with capac- 
itance C, = q°N.f.(1 — f.)/kT and time constant + = f,/¢,n;. . Separat- 
ing (4) into its real and imaginary parts, the equivalent parallel ca- 
pacitance is 


oe ae 
an aE 5) 


and the equivalent parallel conductance is 


Cw r 


G, = 1 + or (6) 





4.3 Equivalent Circuit of MIS Capacitor 
Let Q,r be the total charge density at a given bias. Then 


Or =Qse + VY. + Q, , (7) 


where Q,. is the silicon space-charge density, coul/cm’, Q, the interface 
state charge density, coul/cm’, and Q, the fixed-charge density in the 
oxide, coul/cm’. The ac current density, 77(¢), obtained by differentiating 
(7) with respect to time is 


tr(t) = te(t) + 2.(4), (8) 


where 7,,(¢) and 7,(f) are the ac current densities charging the silicon 
space-charge layer and the interface states, respectively, To obtain 


7,-(t) we have 
wo = (2) 04), ° 


where y,(t) = ¥,. + dy, is the silicon band bending in volts at time, 
t, ¥.. the silicon band bending established by the bias, and 6y, = 
a exp (jut). From this, dy,/dt = jw 6y, . Also, dQ../d¥, = Cp the deple- 
tion layer capacitance per cm’. Substituting these into (9) we get 


tse = JoCn Oy, . (10) 
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It is shown in Appendix A that 7,(£) given by (8) can be written 


7,() = Y, by, , (11) 
where Y, is defined by (4). Substituting (10) and (11) into (8), we have 
tr(t) = (wCp + Y,) dv. . (12) 


From (12), it is seen that Cp appears in parallel with the series RC 
network of the interface states. The voltage, v, = v, + 6v,, applied 
to the capacitor divides between the silicon and the oxide film so that 


a() =v + oe = wD +22, (13) 


where v, 1s the de bias, 6v, = b exp (jwt), and C,, is the oxide capacitance 
per cm’. Differentiating (13) with respect to ¢ to get the ac terms only 


ja bv, = jo bp, + ZO. (14) 
Substituting for dy, from (12) 
: 1 
iv, = in(0(2. + at) ; (15) 


where 
Z; = (jwC'p + ame 
The bracketed term in (15) is the impedance of a circuit in which 


C., 18 in series with Z,. The equivalent circuit for the MIS capacitor 
from (4), (12), and (15) is given in Fig. 6. 


ne? 
Cox 


Cs 


Cp 


x 


Fig. 6 — Equivalent circuit of MIS capacitor for capture and emission of ma- 
jority carriers by a single level interface state. Cos is the oxide capacitance per 
unit area, Cy the depletion layer capacitance per unit area, C, the interface state 
capacitance per unit area, and R, the majority carrier capture resistance. Bulk 
generation-recombination is zero. 
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The principle of the MIS conductance technique is easily illustrated 
by this simplified equivalent circuit. The admittance of the capacitor 
is measured by a bridge across the terminals a—x. The oxide capacitance 
is measured in the region of strong accumulation. The admittance of 
the network is then converted into an impedance. The reactance of 
the oxide capacitance is subtracted from this impedance and the re- 
sulting impedance converted back into an admittance. This leaves Cp 
in parallel with the series RC network of the interface states. The 
capacitance from (5) and (12) is 

C, 5 Cp +e: (16) 


and the equivalent parallel conductance divided by w from (6) is 


G, C wr 

o ~Tta a7) 
G, is divided by » to make (17) symmetrical in or. It should be noted 
that (16) and (17) are equivalent to the Debye equations,! ?° for a 
dielectric in which the polarization can relax exponentially with char- 
acteristic time, 7, after a change in applied electric field occurs. 

Equation (16) describes the capacitance dispersion and is the basis 

of Terman’s method. To extract C,; and 7 from C, using (16), Cp 
must be known. Cp can be calculated’® using an estimated doping 
density. The doping density is not accurately known near the silicon 
surface because of pile up or depletion of the dopant during oxidation. 
Equation (17) on the other hand depends only on the interface state 
branch of the equivalent circuit. G,/o goes through a maximum when 
wr = 1 which gives 7+ directly. The value of G,/w at the maximum is 
C,/2. Thus, equivalent parallel conductance corrected for Coz gives Cs 
and r directly from the measured conductance. 


4.4 Admittance of an Interface State Continuum 

The interface states are observed to be comprised of many levels so 
closely spaced in energy that they cannot be distinguished as separate 
levels. Thus, they appear as a continuum over the bandgap of the 
silicon. These observations are documented in Section VII and dis- 
cussed in Section VIII and agreed with those of other investigators.” 7? 
A continuum of interface states appears to be characteristic of the 
Si-SiO. interface. The admittance given by (4) for a single-level state 
and the equivalent circuit in Fig. 6 must therefore be modified. 

For a continuum of states at a finite absolute temperature, capture 
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and emission of majority carriers can occur by states located within a 
few kT'/q on either side of the Fermi level. This results in a time 
constant dispersion. The admittance of the continuum first calculated 
by Lehovec?? is obtained by integrating (4) over the bandgap 


Nit, qd —_ fo) dy 
Yu= io) Coo ellen) ve) 


Here Nz is the density of interface states, cm? eV, and y is energy 
in ev. The integrand of (18) is sharply peaked about the Fermi level 
with a width of about k7'/q. Thus, (18) can be easily integrated if both 
Nes and C, the capture probability* do not vary very much with y over 
a range kT /q. Experimentally, it is found (see Section VII) that 
neither vary appreciably over k7'/q. Making the substitution f,(1—f,) 
= (kT/q) (df./dy) transforms (18) into an integral over f,. Integrat- 
ing from zero to unity yields. 








Y,, = ie In (1 + wr (19a) 
where . 
™ = ities, (19b) 
Concentrating on the real part of (19a), we have for the continuum 
Ge = fey (i+ ws (20) 
w Wwe 


At no value of bias does (20) even closely fit experimentally measured 
G,/w vs » curves. As will be shown in the next section, (20) is just the 
first step in deriving an expression for G',/w which fits the experiment 
in depletion. To proceed further, it is necessary to consider separately 
the depletion region and the weak inversion region. In depletion, the 
time constant dispersion observed is far broader than given by (20). 
In weak inversion, the time constant dispersion disappears and the 
experimental G,/w vs » curves are characterized by a single time con- 
stant at each bias. 


4.5 Depletion Region 


The model suggested here for the increased broadening of the time 
constant dispersion observed experimentally (see Section VII) in the 
depletion region assumes statistical fluctuations of surface potential 
in the plane of the interface. Majority carrier density at the silicon 

* This capture probability is now the average over all the interface states in 


the range of the integral and all states in the conduction band near the silicon 
surface, 
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surface is related to surface potential by 


Nso = Np expu, = n; exp (u, — Up) for n-type 


and (21) 


I 


Ps. = N, exp (—u,) = n; exp —(u, — Ug) for p-type, 


where 7; is intrinsic carrier density, em™, wz = In n;/Np for n-type 
and up = In N4/n, for p-type. uz is the potential difference between 
mid-gap* and Fermi level in the quasi neutral region of the silicon in 
units of kT’/q, Np is the ionized donor and N, the ionized acceptor 
density in the silicon, cm-*, us is the silicon band bending or surface 
potential in units of k7'/q. ws — uz is the potential difference between 
mid-gap and Fermi level at the silicon surface. 

The relation between time constant and surface potential is from 
(19b) and (21). 


1 
rr exp —(u, — Us) for n-type 


and (22) 


tn = on exp (u, — Uz) for p-type. 
Equation (22) shows that small fluctuations in u, will cause large 
fluctuations in 7,, and thus broadened time constant dispersion. If we 
assume that the built-in charges and charged interface states are ran- 
domly distributed in the plane of the interface, the electric field at the 
silicon surface will fluctuate over the plane of the interface. This is 
shown schematically in Fig. 7. Fluctuations in electric field will cause 
corresponding fluctuations in surface potential. To express this quanti- 
tatively,t we conceptually divide the plane of the interface into a 
number of squares of equal area. The area of each square is the largest 
area within which the surface potential is uniform. We shall call this 
particular area a characteristic area. Now, the admittance of the 
continuum given by (19) can be regarded as the result of integrating 
the admittance of a single level (4) over all the levels located within 
a characteristic area rather than within the entire area of the field 
plate. The total admittance is obtained by integrating the contri- 
bution from each characteristic area over all the characteristic areas 
within the area of the field plate. This must be done for each bias and 
frequency to generate the complete family of curves. The calculated 


* Although the intrinsic Fermi level is slightly above mid-gap in silicon, this 
is neglected and intrinsic Fermi level is taken to be at mid-gap. 

+ This treatment is similar to the one used by W. Shockley in Ref. 23 for 
calculations of the influence of fluctuations of doping on avalanche breakdown. 
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G,/w curves will be compared with the measured G,/o curves to see 
whether the model agrees with observation. For simplicity, we shall 
derive in detail only an expression for the real part of the interface 
state admittance. The steps in deriving the imaginary part are similar. 

Let P(N) be the probability that there are N built-in charges and 
charged interface states in a characteristic area. The number of 
characteristic areas which contain between N and N+dN surface 
charges is 


dv = P(N) dN. (23) 
RANDOMLY 
DISTRIBUTED METAL FIELD 
CHARGES, 
N 


SURFACE DEPLETION 
FIELO-——s<2h > eee 
LINES Y 

NEUTRAL 
SILICON 
P-TYPE 





Fig. 7— Cross section of MIS structure illustrating the random distribution of 
built-in charges and charged interface states over plane of the interface. Field 
lines are all terminated at edge of depletion layer rather than on ionized accep- 
tor impurities in the layer for simplicity. 


The total G,/w is G,/w for the continuum of levels located in a charac- 
teristic area (20) times the number of characteristic areas containing 
between N and N+dWN surface charges (23) integrated over all the 
characteristic areas. To integrate this product over all the characteris- 
tic areas, (20) is expressed as a function of u, using (22) and P(N) 
transformed to P(us), the probability that the surface potential in a 
characteristic area is u,. This transformation will be performed in two 
steps. The first transformation is 


PQ) = PW) dN/dQ , (24)* 


* See Ref. 24 for a discussion of transforming the probability density as a func- 
tion of one random variable to another. 
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where Q is the density of built-in charges and charged interface states, 
coul/cm?. The second transformation is 


Pu.) = PQ) dQ/du, . (25) 


The number of characteristic areas in which the surface potential 
is between u; and u, + du, is P(u,) dus. Multiplying this by (20) and 
integrating, the total G,/o is 


qN.. [” 
2 ie 


It is assumed in (26) that both interface state density and capture 
probability are uniform over several kT of bandgap energy. That these 
are reasonable assumptions can be seen from Fig. 14 and 15. 

Substituting (22) for p-type* into (26) to get (26) as a function 
of Us 


C= 7 In Cl eae Pa) ae,.: (26) 


ie) 


G,/w = 3qQN.; exp (Up — Us, — Uo) 


-In {1 + exp [2(u + u, — ug)]}P(u,) du, , (27) 


where wu) = In w/c,n; . 

The next task is to obtain P(u,) from P(N). Let N be the mean 
number of built-in charges and charged interface states in a char- 
acteristic area. When W is large, P(N) is given by the Gaussian ap- 
proximation of a Poisson distribution 


P(N) = (2cN)? exp [—(N — N)?/2N). (28) 


The characteristic area a is the ratio between the mean number of 
surface charges NV in a and their mean density 7: a = N/f’. From this, 
the relation between N and Q required for the first transformation is 


N = aQ/q. (29) 
Combining (24), (28), and (29), we get 
PQ) = QraQ/q)*(a/¢@) exp [—a(Q — Q)/24Q), (30) 


where Q is the mean of Q. 

The transformation of P(Q) to P(u,) consists of transforming an area 
in the space made up of all the points Q into an area in the space made 
up of all the points u, . This requires a single-valued relation between 


* This part of the derivation is carried out for p-type because the sign of the 
variables for n-type is negative making the analysis harder to follow. 
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Q and u,. A relation between Q and u, which is not single valued is 
obtained from (7) and (13). Retaining only the de terms in (13) and 
expressing surface potential in units of kT/q = 87’, we get 


Q=Q, + Q, = Civ. + Us/B) + Q.-(Us)- (31)* 
Q.-(u;) is the space-charge density which from Ref. 16 is 
Q..(u.) = (2ge.:N4/8)'Lexp (—u.) + u. — 1}, (32) 


where «¢,; is the dielectric permittivity of silicon, farads/em and N 4 
the ionized acceptor density, em™*. Equation (81) is not a single-valued 
relationship between Q and u, because of (32). The difficulty in making 
the transformation from Q space to u, space imposed by (31) and (82) 
is avoided by restricting the problem to the case where the fluctuations 
Q — Q are very small. For small fluctuations, (31) can be differentiated 
assuming N, and oxide thickness to be uniform to get 





dQ) = Cox du, + AQee - (33) 


dQ. 18 eliminated from (83) e using the relation dQ,. = (C v/ B) du, = 
(e,;,/WB) du, where W is the space-charge width. Because Q is given 
by (31) when u, = @, , dQ can be evaluated about Q at each bias by 
setting W = W(d,). a this, (83) becomes 

dQ = Wa Was [W(G.)C oz + €s:] du, (34a) 
Equation (34a) is the transformation equation we seek from an element 
of area in Q space to the corresponding element in wu, space. Replacing 
dQ and du, in (34a) by the small fluctuations Q — Q and u, — %, , re- 
spectively, we get 


0 — 6 = Fy E)Cu + edu, — @,). (34D) 
Combining (25), (30), (84a), and (34b), we get 
P(u,) = (2102)? exp [—(u, — @,)’/202], (35) 
where the standard deviation o, is 
W,)B qQ 
°= Taye aa (A) - 


*In this paper, Q7 + Q. will be calculated from flat band voltage shift 
neglecting work function differences. Work function differences are neglected 
because flat band voltage shifts of several volts independent of the field plate 
metal are found in the samples used. 
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Q can be calculated from (31) and (32) at u, = a@,. In the samples 
measured, the total number of built-in charges was more than an order 
of magnitude greater than the total number of charged interface states. 
Therefore, the bias dependence and the variation during each cycle of 
the signal of the number of charged interface states is negligible. 

Another cause of surface potential fluctuations which shall be taken 
into account is the random distribution of ionized acceptors in the space 
charge region. This is calculated in Appendix B using a model which 
has previously been used to determine the fluctuations of breakdown 
voltage.”’” Essentially, the procedure is similar to the one for the 
surface charges except a characteristic cube having a side equal to W 
is used instead of a characteristic area and Q rather than N, is assumed 
to be uniformly distributed. The standard deviation from Appendix B, 
paragraph B.1 is 


., < WNW .)\'[L — exp (—a,)] 
2[WW.)Coe + ei 


where NV, is the mean ionized acceptor density. 

A theorem in statistics states’® that if two or more independent 
variables are normally distributed their sum is also normally distributed 
with a mean and variance which is the sum of the mean and variance 
of each variable. Using this theorem, P(u,), which includes the influence 
of both the randomly distributed surface charges and ionized acceptors 
on the fluctuations of surface potential is 


P(u,) = [2r(o7 + o3))? exp [—@, — @.)°/25 + o3)]. (38) 
Substituting (88) into (27) 
G,/w =49N,,[2a(o; =F a 





(37) 


ce} 


7 exp[—@+y)] nd +e")du., (39) 


where y = Inwrn = Uo + U, — Us andz = (u, — U,)"/2(0? + of). 
By similar arguments using (19), the equivalent parallel capacitance is 


C, = Co(t.) + QN..[2r(o% + o%)]? 


ive) 


, exp [—( + y)] arc tan (e”) du, . | (40) 


Depletion layer capacitance Cp now becomes a function of a, . The 
function C p(us) has the same form as in Ref. 16 as long:as the fluctua- 
tions Q — Q are small. 
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It will be shown in Section VIII that the influence of the randomly 
distributed ionized acceptors on the fluctuations of surface potential 
are small in the samples measured compared to the influence of built-in 
charges. 

In summarizing the theory for the depletion region, it is shown how 
the interface state branch of the equivalent circuit is developed. The 
interface state branch of the equivalent circuit of Fig. 6 applies to a 
single level in the continuum. In each characteristic area where the 
surface potential is uniform, each level contributes a series RC network. 
Integrating the admittance of each level given by (4) over all the levels 
located in a characteristic area gives (19) and (20). Integrating again 
but this time the contribution to the total admittance from each char- 
acteristic area over all the characteristic areas gives (89) and (40). 
The interface state branch of the equivalent circuit therefore, can be 
represented by an infinite number of series RC networks connected in 
parallel as illustrated in Fig. 8(a). 


4.6 Weak Inversion 


For n-type, the silicon surface will be in weak inversion when the 
Fermi level is more than a few k7'/q below mid-gap in the lower half 
of the bandgap. The minority carrier density at the silicon surface will 
now be orders of magnitude greater than the majority carrier density 


; Leg ea 


Cp) Gp) 


x 


(a) (b) 


Fig. 8— (a) Equivalent circuit for depletion region showing time ae at 
dispersion caused primarily by statistical fluctuations of surface potential. 
is the oxide capacitance per unit area, and Cp the depletion layer capacitance per 
unit area. Each subnetwork consisting of C, and R, im series represents time con- 
stant of the continuum of interface states in a characteristic area. Bulk generation- 
recombination is zero. (b) Simplified version of (a) useful when extracting Coz 
from the measured admittance. Cp(w) is the capacitance per unit area at a given 
bias and frequency of the distributed network in parallel with Cn. Gp(w) is the 
equivalent parallel conductance per unit area. 
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at the silicon surface. The minority carriers can no longer be ignored 
as in depletion and the equivalent circuit of Fig. 8(a) does not apply. 
To derive expressions for the interface state admittance we start again 
by considering a single level state. The capacitance associated with 
this state is C, = q’°N.f,(1 — f,)/kT and its time constant for majority 
carrier transitions (electrons) is 7,,, = f./C,Ns. aS derived previously 
in paragraph 4.2. The majority carrier capture resistance from this is 


Ra.s = ts/C, = (kKT/@)[en.N (1 — f.)]": (41) 


By similar reasoning, it can be shown that the minority carrier 
(holes) capture resistance is 


h,,. = (kT/q’)le, oN fol’, (42) 


where c, is the capture probability for holes em’/sec and p,, is the 
hole density cm™* at the silicon surface when the Fermi level is at the 
interface state level. 

The equivalent circuit for a single level is given in Fig. 9(a). From 
(41) and (42) 


Ra,e/Ro,2 = (Cp/Cn) exp (Up — 3u). (43) 


Equation (43) has been obtained by substituting p,, = n; exp (—w), 
Neo = nN; exp (u), and f, = [1 + exp (u — u,)]” into (41) and (42). 
u is the potential difference between mid-gap and the interface state 
level in units of kT/q and uy, is the potential difference between mid-gap 
and the Fermi level at the silicon surface in units of kT/q. There will 
be no detectable loss due to transitions of electrons and holes to and 
from interface states unless u % uy because f,(1 — f,) is so sharply 
peaked around the Fermi level. Equation (43) becomes 


Ry, s/Rp, = (¢,/€,) exp (—2up). (44) 


It is seen from (44) that #,,,, will rapidly become orders of magnitude 
greater than R,,, when the Fermi level gets more than a few kT'/q 
below mid-gap even if c, is one or two orders of magnitude smaller 
than c, . Time constant for majority carrier transitions is 7,,, = C,R,,, , 
and for minority carrier transitions it is 7,,, = C,R,,,. Thus, 7,,, will 
be much greater than 7,,, in weak inversion. If the period of the applied 
signal frequency is comparable to 7,,, as it is in these experiments, 
then there will be virtually no loss associated with minority carrier 
transitions. Each interface state near the Fermi level is charged and 
discharged instantaneously by minority carriers so that R,,, ~ 0. 

Because F,,, is negligibly small, it can be replaced by a short circuit 
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Cs Cs 
Rn,s Rp,s Rn,s 
CB CB 
VB VB 
(a) (b) 


Ce I I-3C, 
vB --- vB 
Rn,s ERnis 
ae ee eae cB 
(d) (e) 


Fig. 9— (a) Equivalent circuit for a single level interface state in region of 
weak inversion. C’, is the interface state capacitance per unit area, Rn,. the cap- 
ture resistance for electrons, and R»y,, the capture resistance for holes. CB repre- 
sents the conduction band and VB the valence band. (b) Equivalent circuit for 
a single level interface state for the case in which minority carriers are holes 
which respond instantaneously to the applied signal. For this representation, it 
is also necessary that the lateral conductance due to minority carriers along the 
surface is large compared to Rn,.*. (c) Circuit (a) generalized to include the fact 
that there is a continuum of states across the bandgap of silicon. Cr is the in- 
version layer capacitance per unit area. (d) Circuit (c) simplified by fast response 
of minority carriers. Cy is negligible in the region of weak inversion. (e) Simplifica- 
tion of circuit (d). 


which connects the interface state capacitance directly to the minority 
carrier (valence) band as shown in Fig. 9(b). Those interface states 
in the continuum which are located in a given characteristic area each 
have a different time constant. These interface states must be repre- 
sented by a distributed network such as Fig. 9(c). Only those levels 
in a narrow range of energy about a k7’ wide centered about the Fermi 
level participate in the capture and emission process. Therefore, the 
corresponding range of time constants is narrow. The large minority 
carrier density at the surface then means that the minority carrier 
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capture resistance R, , of every one of these levels is negligibly small. 
Thus, the capacitance corresponding to each of these levels can be 
connected directly to the minority carrier band. The large inversion 
layer conductance shorts all the characteristic areas together. The net 
result is represented in Fig. 9(d) which can be further simplified to 
Fig. 9(e). G,/w vs w now will be characterized by a single time con- 
stant at each bias. The expression for G,/w is (17) and for C, it is (16). 
The value of C,, in (16) and (17) is the sum of all the individual interface 
state capacitances in parallel as shown in Fig. 9(c). The inversion 
layer capacitance C, calculated in weak inversion using the relation 
given in Ref. 12 is negligible compared to =C, obtained from meas- 
urement, 

If interface state density N,, is slowly varying with the position of 
the Fermi level, the statistical fluctuations of surface potential will 
have negligible effect on the total C’, . However, in analyzing the meas- 
urements on n-type samples in weak inversion using (17), it is found 
that C, varies quite rapidly with position of the Fermi level (see Fig. 27). 
This implies that N,, is also rapidly varying so that the statistical 
fluctuations of surface potential must be taken into account when 
calculating the relation between N,, and total C’,. Total capture con- 
ductance is the sum of the capture conductances R,,; of the individual 
interface states in parallel as shown in Fig. 9(e). 

The statistical fluctuations of surface potential must be taken into 
account in calculating the total capture conductance because capture 
conductance depends on n,, as well as the rapidly varying N,,. If 
desired, C’, can be calculated by integrating C, for a single level over 
bandgap energy and then over surface potential using the statistical 
model developed earlier. Similarly, total capture conductance can be 
calculated by integrating capture conductance for a single level ob- 
tained from (41) over bandgap energy and then over surface potential. 
These integrals cannot be evaluated without knowing the functional 
dependence of N,, on the position of the Fermi level. This dependence 
can be found by numerically fitting the calculations to the measure- 
ments. Although N,, and majority carrier capture probability can be 
obtained in this way from measured C, and 7,,, such an analysis will 
not be carried out here. 

A comparison between the depletion and weak inversion regions 
will illustrate the processes occurring. In depletion except very near 
mid-gap, majority carrier density at the silicon surface is several 
orders of magnitude greater than minority carrier density at the silicon 
surface. Therefore, the time constant for minority carrier transitions 
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is much greater than the time constant for majority carrier transitions. 
The time constant for majority carrier transitions is comparable to 
the period of the applied signal frequency. Therefore, minority carrier 
density cannot follow the signal at all and ac current flows simply by 
transitions between interface states near the Fermi level and the 
majority carrier band. Thus, the interface state branch of the equivalent 
circuit in Fig. 8(a) can be obtained by open circuiting R,,, in Fig. 9(c). 

In the weak inversion region except very near mid-gap, again the 
time constant for majority carrier transitions is comparable to the 
period of the signal frequency and several orders of magnitude greater 
than the time constant for minority carrier transitions. Now both 
majority and minority carrier densities respond to the signal so that 
ac current flows by generation and recombination through interface 
states near the Fermi level. This current flow is controlled by the 
majority carrier capture resistance as it is the largest. In this range, 
the interface states can easily communicate with the minority carrier 
band but the minority carrier band has no connection with the bulk. 
Thus, the main effect minority carriers have in the equivalent circuit 
is to tie all R,,,C, together as shown in Fig. 9(d), thereby eliminating 
the time constant dispersion. 

In the region around mid-gap where majority and minority carrier 
densities at the silicon surface are comparable, no simplification is 
possible and the equivalent circuit of Fig. 9(c) holds. This region is 
only a few kT'/q wide and no analysis was attempted of measurements 
in this region. 


V. MEASUREMENT APPARATUS, REPRODUCIBILITY AND SAMPLE DRIFT 


5.1 Measurement Apparatus 

The measurement of the equivalent parallel conductance of an MIS 
capacitor for the purpose of extracting interface state properties re- 
quires consideration of the following points which are much more im- 
portant in conductance measurements than in capacitance measure- 
ments alone. 

(1) Harmonies of the signal frequency due to the non-linearity of the 
charge-voltage characteristic of the MIS capacitor can give rise to a 
conductance. To insure that this is negligible and the conductance 
measured is due to interface state transitions only, the maximum 
swing of surface potential caused by the applied ac signal should be 
less than k7'/q volts. 

(iz) The conductance measurements reported here are in the range 
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between a nmho and about 10 » mhos which corresponds to very small 
loss angles. Such small conductance values require that current leakage 
paths along the air-oxide interface be minimized. 

(122) To verify the theory developed in the previous section, capaci- 
tance, equivalent parallel conductance, and bias because of the ex- 
ponential dependence in (22) must be measured very accurately. 

The slices with the completed MIS capacitors were placed on a brass 
pedestal capped with platinum in a light-tight grounded box for 
measurement. A steady stream of dry nitrogen was supplied to the box. 
This keeps the oxide surface dry thereby preventing the formation of a 
current path for the ac signal along the oxide surface from the field 
plate to the back contact. It is especially important when making con- 
ductance measurements that such current paths be eliminated. 

To avoid scratching the field plate and thereby reducing its area, a 
gold wire, 125 » in diameter, mounted on a micromanipulator was 
used to make contact to the field plate. 

For high sensitivity, it is necessary to measure accurately con- 
ductance when the loss angle is very small. The most suitable instru- 
ment for doing this over a wide range of frequencies is a capacitance 
bridge. The General Radio 1615-A capacitance bridge was found 
satisfactory over the frequency range from 50 Hz to 20 kHz and the 
Boonton 75-C capacitance bridge from 5 kHz to 500 kHz. Three term- 
inal capacitance measurements were made with these bridges. Fig. 10 
is a block diagram showing the arrangement. 

There are three parameters which require special attention in these 
measurements. They are (2) frequency, (72) signal amplitude, and (72) 
de bias. 


5.1.1 Frequency 


Frequency must be measured with a counter so that the precision 
in the calculated admittance of interface states and silicon is limited 
only by the precision of the bridges. The calibration marks on the 
oscillator dial in either the Boonton bridge or the hp signal generator 
are not accurate enough to obtain frequency (see Section VI). 


5.1.2 Signal Amplitude 


To keep harmonics of the signal frequency from giving rise to a 
spurious conductance, only signals of small amplitude can be ap- 
plied. The small signal range is determined by experiment. Fig. 11 is a 
plot of measured capacitance and equivalent parallel conductance 


1084 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1967 


MOS BIAS SUPPLY 






GR-1232 A 





NULL 
DETECTOR 






GR-1230A 





SAMPLE 








hp 200 cd 
OSCILLATOR 


normalized to their largest values as functions of peak-to-peak signal 
amplitude at a bias corresponding to peak conductance at 100 Hz, 
measured at a point where the slope of the MIS capacitance is large. 
There is a range of low signal amplitude at which both capacitance and 
equivalent parallel conductance are independent of signal amplitude. 
This is the experimentally determined small signal range. The maxi- 
mum amplitude of the ae silicon surface potential produced by this 
signal is about two fifths the applied signal amplitude. 

To see this, note that the applied signal divides between the oxide 
layer and the silicon. Expressing this mathematically, substitute (12) 
into (15): 


FREQUENCY 
COUNTER 


Fig. 10— Block diagram of measuring apparatus, 






= isle + Ye. 
Wa aad iv,| 1 a jo 5, (45) 


For a fixed signal amplitude, the amplitude of the silicon surface po- 
tential varies with bias because Cp and Y, are functions of bias. 
Maximum silicon surface potential amplitude occurs when the Fermi 
level is near mid-gap because that is where Cp and | Y, | are smallest. 
The reason | Y,; | is small is that the majority carrier density is small 


ELECTRICAL PROPERTIES OF SILICON-OXIDE INTERFACE 1085 


making the interface state time constant long. Thus, to a good ap- 
proximation | o(C,.+Cp) | > |Ys| so that (45) becomes 


ee oe 
Oa 7 Cor + Cp 


Typical values are: C,, = 5.73 X 10° farads/em” and Cp = 8.60 X 107° 
farads/cm* which gives 5y,/5v, = 2/5. 

At the maximum sensitivity setting of the null amplifier, null is 
found in the small signal range at a null meter reading of 1/2 vA on 
the Boonton bridge and +8/4 uV on the General Radio bridge. These 
are the null readings determined essentially by the noise in the sys- 
tem. Therefore, harmonics of the signal frequency arising from the 
nonlinearity of the charge-voltage characteristics of the MIS capacitor 
are unimportant. However, as signal amplitude is increased beyond 
this range, nulls are obtained at successively higher null meter read- 
ings. This means that the amplitude of the harmonics has become 
large enough to get through the tuned null amplifier. In this range of 
signal amplitude, the conductance peak decreases in magnitude and the 
conductance curve is broadened. 

Operating in the small signal range entails a reduction in bridge 
sensitivity. Therefore, optimum signal amplitude is at the high end of 
the small signal range. The small signal range is determined by the 





(46) 


Cm/Cmax OR Gm/Gurax 





PEAK-TO-PEAK SIGNAL AMPLITUDE IN mV 


Fig. 11— (a) Normalized capacitance and (b) normalized equivalent parallel 
conductance at fixed bias. The maximum value of capacitance is 367.8 pF and 
conductance 58.8 n mhos. Frequency is 100 Hz. Bias corresponds to peak con- 
ductance at this frequency. 
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oxide thickness and silicon resistivity. This range extends to higher 
signal amplitudes as the oxide is made thicker or silicon resistivity 
reduced. For oxide thicknesses between 500 A and 700 A and silicon 
resistivities around 1 ohm cm which is typical of the samples meas- 
ured in this experiment, optimum peak-to-peak signal amplitude is 
found to be 50 mV. For other oxide thicknesses and silicon resistivities, 
optimum peak-to-peak signal amplitude can be found by repeating the 
measurements shown in Fig. 11. An external attenuator is needed on 
the hp signal generator to get an amplitude of 50 mV. Signal amplitude 
is measured with an oscillosocope and adjusted whenever frequency is 
changed. The maximum peak to peak silicon surface potential with this 
signal is 2/5 X 50 = 20 mV which is less than kT/q at 300°K and 
therefore small signal. 


5.1.3 DC Bias 


Because interface state time constants are an exponential function 
of silicon surface potential, de bias must be measured to at least three 
decimal places. The Cimron 6900A digital voltmeter is more than 
adequate for this purpose. 

DC bias is supplied to the MIS capacitor through a voltage divider 
in each bridge. Therefore, it must be measured across the sample 
rather than at the supply terminals. The voltmeter puts an additional 
capacitance and conductance across the sample which are read by the 
bridge. To avoid the correction this entails, the voltmeter is switched 
out of the circuit before the bridge is balanced. Because the input 
resistance of the voltmeter no longer shunts the MIS capacitor, the 
current through the voltage divider in the bridge decreases. Therefore, 
the voltage across the MIS capacitor is slightly larger than that meas- 
ured. For the Cimron 6900A digital voltmeter which has an input 
impedance when balanced of 10° ohms, this is not a problem. However, 
for a digital voltmeter having an input impedance of a few megohms, 
this correction becomes apparent whenever the resistance values of the 
internal bridge voltage divider are changed. These changes occur when 
switching the conductance multiplier on the Boonton bridge, the 
conductance to loss factors switch on the General Radio bridge or in 
switching from one bridge to another. To eliminate this error, a Gen- 
eral Radio 1230A electrometer which has an input resistance of 107° 
ohms with the resistance selector set at oo can be used to measure 
bias voltage across the sample. The output of the electrometer is then 
read by the digital voltmeter. 

To permit control of the bias voltage to within a fraction of a 
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millivolt, a voltage divider consisting of a bank of fixed resistors in 
series with a 10-turn helipot is used in conjunction with a heavy duty 
45-volt battery in a shielded box. 


5.2 Reproducibility of Data and Sample Drift 


Sample preparation is designed to eliminate drift under negative 
bias at room temperature only. The admittance-voltage characteristics 
in the range of interest for both n- and p-type samples are located in 
the negative bias range because of the presence of a fixed residual 
positive charge in the oxide. Therefore, n-type samples are in strong 
accumulation and p-type in strong inversion at positive bias. No con- 
ductance measurements are made at positive bias because conductance 
due to interface states is too low to detect in strong accumulation or 
inversion. 

Reproducibility of the measurements on a given MIS capacitor was 
assured as follows. Measurements are repeated after room temperature 
bias aging at high negative bias, after the lapse of several days, and by 
hysteresis measurements where the admittance-voltage characteristics 
are measured first with increasing bias and then with decreasing bias. 
Only those samples where the measurements were found to be repro- 
ducible within experimental error are used. Interface state properties 
for such capacitors on a few different slices are found to be similar. 


VI. TECHNIQUE FOR ANALYZING THE DATA 


6.1 Properties of Equivalent Parallel Conductance 


The variation of G,/w with bias or frequency can be most easily seen 
from (17) and (22) for a single time constant. In (39) derived from 
the statistical model which fits the measurements exactly in the 
depletion-region, (see Fig. 21) the variation of G,/w with bias and 
frequency is obscured by the mathematical complexity. Equation (17) 
is adequate for illustrative purposes because of its simplicity and the 
fact that its variation with bias and frequency is qualitatively the 
same as (39) in the depletion region. Equation (17) fits the data 
exactly in the weak inversion region (see Fig. 25). G or G,/w can be 
obtained from measurements as a function of bias with frequency 
as parameter or as a function of frequency with bias as parame- 
ter. G, or G,/o will go through a peak as a function of bias at 
each frequency as seen from (17) and (22). For a continuum of states, 
this peak occurs when the response time of the interface states which is 
being varied by the bias equals the period of the applied signal 
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frequency multiplied by a constant. When the frequency is increased, 
this peak shifts to a bias value corresponding to a surface potential 
nearer to flat bands. Fig. 1 shows capacitance and equivalent parallel 
conductance measured at 5 kHz and 100 kHz in a wet oxide. This 
figure illustrates how measured equivalent parallel conductance varies 
with bias and frequency. Fig. 1 also shows how inaccurate the capaci- 
tance method for extracting information about interface states is 
compared to the conductance method. It can be noted from Fig. 1 that 

(1) Maximum capacitance dispersion in this frequency range is 
about 14 percent while equivalent parallel conductance changes 
more than an order of magnitude. 

(iz) The conductance peak moves towards a bias voltage closer to 
flat band bias which in Fig. 1 is —2.6 volts as frequency is increased. 
Between 50 Hz and 500 kHz, conductance does not peak at flat band 
bias in the samples measured.* The peak shifts over the bias range 
from weak inversion to depletion in this frequency range. 

(12) Capacitance and equivalent parallel conductance are related to 
each other by the Kronig-Kramers relations. Rather than going 
through a mathematical proof of this, we note that (16), (17), (89), 
and (40) have the same form as the Debye equations.?* ?° Because the 
Debye equations satisfy the Kronig-Kramers relations,?’ we infer that 
(16), (17), (89), and (40) do also. 

G,/o as a function of frequency will go through a peak at each value 
of bias but in this case G, will not. It can be seen from (17) that only 
G,/w is a symmetric in wr when frequency is the variable while both 
G, and G,/w are symmetric in wr when bias is the variable. 

G,/w vs bias (with frequency as parameter) will be spread over a 
bias range determined by the time constant dispersion and the density 
of interface states. The density of states spreads the G,/w vs bias curve 
in the same way that it spreads the high-frequency capacitance vs 
bias curve. However, G,/o vs w (with bias as parameter) will be spread 
over a frequency range determined only by the time constanct disper- 
sion. Because time constant dispersion determines the interface state 
admittance, G,/w vs w curves are the more useful of the two for the 
purposes of this work. 


6.2 Extraction of G,/w 
It is necessary first to get G,/w and C, from the measurements. C, 
and G, are shown in Fig. 8(b) to be the capacitance and equivalent 


* To get the conductance to peak at flat band bias, frequencies higher than 500 
kHz have to be applied. 
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parallel conductance of the portion of the equivalent circuit consisting 
of Cp in parallel with the distributed network representing the inter- 
face states. The admittance measured across the terminals z-x in 
Fig. 8(a) is Gm + joCm. Converting this to an impedance, subtracting 
away the reactance of C,,, and converting back to an admittance, we 
get 


G, = wC?2.Gin(Gm ae w Cr) (47)* 
w wo C2? + [ew Cn (Cor — Cn) — GP 
and: 
2 2772 2 2 
C, = CAG. + @ Cryo CitC:. a Co) a Gr] (48)* 


The measured G,/w from (47) is related to Ns, and rm by (39) and C, 
from (48) is related to Cp, Ns, and tm by (40) in the depletion region. 

It should be noted that the circuit across the terminals xz-x in 
Fig. 8(a) has a shorter time constant than that of just the interface 
state branch of the circuit because of C,,. Therefore, G,/w from (47) 
will peak at a lower frequency than G,/w measured across terminals 
x-x when equivalent parallel conductance is measured as a function 
of frequency with bias as parameter. If equivalent parallel conductance 
is measured as a function of bias with frequency as parameter, Gp/w 
from (47) will peak at a bias close to flat bands for the same reason 
as can be seen from (22). 

To minimize errors introduced by correcting for Co,, the oxide 
should be made as thin as practicable and frequency measured ac- 
curately with a counter as described in Section V. 


6.3 Bias vs Surface Potential 


From (22), there is an exponential relation between interface state 
time constant and surface potential. Therefore, bias is measured with 
a digital voltmeter as described in Section V and the relation between 
bias and surface potential is determined as accurately as possible. It 
is necessary to do this to prove that (22) fits the measurements and 
to extract c, and c, accurately. One of the most accurate methods for 
obtaining the relation between bias and surface potential has been 
developed by C. N. Berglund.?* In this method, capacitance is meas- 
ured as a function of bias at such a low frequency that the interface 


* These equations were evaluated using the measured values of w, Coz, Cm, and 
Gn on an IBM 1620 computer. 
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states are in equilibrium with the silicon. This means that the loss 
associated with the capture and emission of carriers by the interface 
states is negligible. 

Bias vs surface potential is obtained in two steps from the measured 
data. First, surface potential is found to within an additive constant 
at each bias. Then, the additive constant is found. 


6.3.1 Surface Potential within an Additive Constant 
Low-frequency capacitance C,,, measured at bias v, is 


Com = AQr/dd, . (49) 
Differentiating (13) with respect to v, 
1 = dy,/dv, + (1/C.2) dQr/dv, (50) 
From (49) and (50) 
dy./dv, = 1 — Com/Coz - (51) 


The surface potential at any bias v,, is found by integrating (51) 


Walon) = [CL = Con/Cu) dos + A, (52) 


Where A is an additive constant and vy, is a bias in strong accumula- 
tion. 

Thus, surface potential at each bias can be obtained to within an 
additive constant directly from measured data by an integration. The 
fact that an integration is carried out using directly measured quanti- 
ties makes this method accurate. 

It is found that in samples oxidized in steam as described in Section 
II, 50 Hz is a low enough frequency to measure C,,, vs VU, and use (52) 
with negligible error. This is seen by the observation that the loss 
tangent is negligibly small in the bias range investigated here below 
about 80 Hz so that dispersion in the capacitance-voltage characteris- 
tic virtually disappears in this frequency range. | 

Another important consideration is that (52) will not give an 
accurate result if there are any gross nonuniformities. Gross nonuni- 
formities occur when there are macroscopic areas under the field plate 
in which the charge density is significantly different from adjacent 
areas. Whether or not there are such nonuniformities present in a 
sample can be determined by evaluating (52) using C,,, measured 
from strong accumulation to strong inversion. The resulting value of 
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Ws Will be slightly less than the bandgap,* if there are no gross nonuni- 
formities and considerably higher if there are. 

The random distribution of built-in charges and charged interface 
states does not affect the accuracy of (52). Because of the statistical 
fluctuations of surface potential caused by this random charge distri- 
bution, bias is really found as a function of mean surface potential. 


6.3.2 Determination of Additive Constant 


The additive constant A in (52) and the doping density is calculated 
next. For this purpose, Cp must be found as a function of bias in the 
depletion region. Then C;>’ is plotted as a function of y,, — kT’/q where 


ce i “= €,./02) db, 


G,/w from (47) peaks at bias values within the depletion region at 
frequencies above a few hundred Hz. Cp is found from admittance 
measured at these frequencies using (48) to get first C, . Then from (40) 


C; = Cp ae C., ila) (53) 


where C,(w, %,) is the equivalent capacitance of the distributed network 
representing the interface states in Fig. 8(a) at the frequency and bias 
corresponding to (G,/w)max Of the measured curve obtained using (47). 
It is shown in Appendix C that C,(w, @,) = 2(G,/w)max Where (G,/w) max 
is the measured peak. Then, using (53) we get 


Cp = C, — 2(G,/w)max + (54) 


In the depletion region except near flat bands, Cp can be written 
in the following form using the approximate relation between Cp 
and @, *° 


p = 24, — 1)/ge.:N 48, (55) 


where ¢,; is the dielectric permittivity of silicon. Thus, a plot of C3’ 
measured far from flat bands in the depletion region vs ¥,, — kT'/q 
measured in the same region will yield a straight line as shown in Fig. 12. 
The intercept of this line with the abcissa when it is extrapolated to 

> = 0 gives A. From this, bias is found as a function of surface po- 
tential. Finally, the slope of the line in Fig. 12 gives the doping density 


from (55). 


* This is particularly the case for samples having high doping density such as 
those used in this work. 
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6.4 N,, and 7,, in Depletion 


Equation (89) for G,/w from the statistical model is found to fit 
the measured G,/w vs frequency curves accurately at each bias (see 
Section VII). To prove this, (89) is numerically integrated on an IBM 
7094 computer using the trapezoidal rule.* a, corresponding to the 
particular bias at which G,/w vs frequency is measured is obtained from 
the relation between bias and surface potential just found. The char- 
acteristic area a is the only parameter which is varied to obtain the 
best fit to the measured curve. N,, can be obtained independently. 


(1/Cp)2 x 10'9 (FARADS) 

















0 0.04 0,08 0.12 016 O20 024 028 0.32 0.36 
¥se-KT/q (VOLTS) 


Fig. 12—1/Co? vs Yse—kT/q where Cp in farads is obtained from measure- 
ments in the depletion region of an n-type sample (Vpn = 1.15 & 10” cm™) and 
Wee from the integral in (52). Field plate diameter is 3.8 X10 cm. 


The parameter @ is not arbitrary. It is found that a? has about twice 
the value and the same bias dependence as the silicon depletion layer 
width. (see Fig. 22). 

Fig. 13(a) shows G,/w found by integrating (39) plotted as a function 
of wr, With N,, arbitrarily chosen as 1 X 10° em™? eV™ and a ata 
bias midway between flat bands and mid-gap. For comparison, Fig. 
13(b) is a plot of (17) and Fig. 13(c) is a plot of (20) vs w7,, for the 
same value of NV,,. From Fig. 13(a), it is seen that the maximum value 
of G,/w from (39) occurs when wr, = 2.5. This was found to hold - 

* To do this, limits of integration in (39) must be finite. Because the fluctuations 


of surface charge are small, integrating from uz. = — 2uz to us = 2uz will give 
a good approximation to the integral from —oo to oo. 











STATISTICAL 
MODEL 


G p/w (x to-? FaRADS/cm2) 




















Fig. 13— Calculated G,/w vs log wrm. (a) Plot of (39) incorporating time 
constant broadening from statistical fluctuations of surface potential as well as 
continuum of states. a = 2.5 X 107° cm? and impurity density = 2.1 X 10° cm™ are 
used. (b) Plot of (20) calculated by Lehovec in Ref. (19) for a continuum of 
constant density and capture cross section. The same interface state density 
arbitrarily chosen as 1 X 10% cm~-eV™ is used in calculating all the curves. 


at all values of bias measured. The condition wr, = 2.5 is used to 
find 7, from the frequency at which the measured G,/w vs frequency 
curve goes through a maximum at each bias. The 7,, found this way 
corresponds to the mean surface potential @, at each bias. The condition 
wr, = 2.5 can be used to get 7, when G,/w is measured as a function 
of bias with frequency as parameter. In this case, w is given by the 
fixed frequency but the 7,, obtained now corresponds to the bias at 
peak G,/w. 

Interface state density vs bias is found by numerically integrating 
(39) with wr, = 2.5, equating the result to the measured peak G,/w 
value, and solving for N,, at each bias. The condition w7,, = 2.5 is 
equivalent to setting uw = uzp — u, + In 2.5 in (89). Doing this, (39) 
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becomes 
(S) = aq. 2e(0! + 08) Mas (56) 
and 
Inox = [exp [-@ + yal] n+ exp Cyn du, — 67 
where 


Yn =U, —%, + In2.5 and z= (u, — @,)?/2(e% + a3). 


Equation (56) and (57) can be evaluated once a is known. The 
simplest way to find an approximate value for @ is to use the experi- 
mentally established relation between a? and W where W is the deple- 
tion layer width. W can be obtained from the relation 





Ess 
W= GC; ) (58) 
where Cp is obtained from (54). 
Solving (56) for N,, 


Ni. > 2( &) [2r(o. + oH) Une Gs (59) 
Substituting (57) into (59) and using values of a from IT'ig. 22, (59) 
was evaluated to find N,, as a function of the position of the Fermi 


level in the bandgap of the silicon. 


6.5 N,, and 7, in Weak Inversion 


This is a particularly simple case because G,/w vs frequency is char- 
acterized by a single time constant at each bias. G,/w vs frequency 
or bias curves are obtained using (47) as before. Fig. 6 and (17) apply 
so that the condition for maximum G,/w is wr, = 1 and the value 


of (G,/@)max 18 C’,/2. 


6.6 Capture Probabilities and Cross Sections 


Log 7,, is plotted as a function of uw, — a, from measurements in 
the depletion region to see if the data fit (22). It is found this way 
that the measurements fit (22) quite well for both holes and electrons 
(see Section VII). The fact that the experimental data fit (22) means 
that c, and c, are independent of bandgap energy. Therefore, c, and 
c, are not affected by statistical fluctuations of surface potential and 
are the .true capture probabilities of the interface states. Capture 
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cross section is related to capture probability by the relation c, = dc, 


for electrons where @ is the average thermal velocity of electrons. 
c, and c, are calculated from experimental plots of (22) and o, and a, 
found using v = 10’ cm/sec for both holes and electrons. 

VII. EXPERIMENTAL RESULTS 

7.1 Depletion Region 


7.1.1 Steam-Grown Oxides on [111] Orientation 


Fig. 14 shows log 7,, vs ug — &, calculated from the measurements 
as explained in Section VI. The values plotted correspond to peaks 


107! 
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Fig. 14— Log rm vs us — t.—showing fit of experimentally obtained points to 
(22) OE both holes and electrons using steam grown oxide and [111] orientation. 
Points in lower half of the gap were measured on a p-type sample. Field plate 
diameter is 1.2 X 10° cm, acceptor density 2.08 x 10°* em™, wz = 14.1 and Coz = 
5.74 X 10° farads/cm’. Points in upper half of the gap were measured on an n-type 
sample. Field plate diameter 3.8 X 10-* cm, donor density 1.15 X 10% cm™, uz = 
13.5, and Coz = 5.08 X 10-“farads/cm’. 
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of the G,/w curves obtained from measurements made at 300°K. The 
condition wr, = 2.5 is used to get 7,,. Because only majority carrier 
transitions give rise to measured conductance, curves in the lower half 
of the silicon bandgap were obtained from measurements on p-type 
and in the upper half on n-type. It is seen that these curves fit (22) 
so that capture cross sections are obtained simply by extrapolating the 
curves to u,; — & = O where (22) becomes r,, = (én,o)~*. This is 
then solved for o to get capture cross section. Capture cross sections 
obtained this way from Fig. 14 are 


o, (holes): 2.2 * 10°*° em? from curve (a) 
c, (electrons): 1.7 * 107° cm’ from curve (b) 
c, (electrons): 5.9 X 10°*° cm’ from curve (c). 


In all cases, 6 = 10’ cm/sec and n; = 1.55 X 10’° cem™ have been used. 

Fig. 15 shows the corresponding values of N,, as a function of the 
average position of the Fermi level Z with respect to mid-gap potential 
at the silicon surface. Measurements in the lower half of the gap were 
made on p-type and in the upper half on n-type. 

Fig. 16 illustrates the temperature dependence of 7. Curve (a) is 
calculated from measurements made at 204°K while curve (b) is just 
a replot of curve (a) from Fig. 14 for comparison. 


UPPER HALF OF GAP, N-TYPE LOWER HALF OF GAP, P-TYPE 


Noss(xio!l cm-2eV~) 





Ee—E; (UNITS OF kT/q) 


Fig. 15—N,,. from (59) vs Er — Ei measured in units of kT’/q for same two 
samples in Fig. 14. Flat band is 14.1 in lower half of the gap and 13.5 in upper 
half. Er is position of the Fermi level and E; is mid-gap. 
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Fig. 16—Log tr», vs ¥. for p-type sample of Fig. 14: (a) measured at 204°K and 
(b) at 300°K. (b) Replot of (a)in Fig. 14. Curves show that experimentally obtained 
points satisfy temperature dependence of (22). (a) is the only one in this experiment 
measured at a temperature other than 300°K. 


7.1.2 Steam-Grown Oxides on [100] Orientation 


Fig. 17 shows log 7,, vs us — &, for the [100] orientation calculated 
from measurements made at 300°K. Capture cross sections are 


og, (holes): 2.0 X 10°*° cm? from curve (a) 
a, (holes): 4.0 X 107*° cm? from curve (b) 
o, (electrons): 1.2 X 10°*° cm? from curve (c). 


Fig. 18 shows the corresponding values of N,, in a plot similar to 
Fig. 15. 
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Fig. 17 — Log rm vs us — ti, showing fit of experimentally obtained points to 
(22) for both holes and electrons using steam-grown oxide and [100] orientation. 
Points in lower half of gap were measured on a p-type sample. Field plate diame- 
ter is 1.37 X 107 cm, acceptor density 1.19 X 10° cm™, wz = 13.5, and Coz = 4.94 X 
10“farads/cm*. Points in upper half of gap were measured on an n-type sample. 
Field plate diameter is 1.43 x 107 cm, donor density 144 x 10° cm™, us = 18.7, 
and Cor = 4.24 X 10“farads/cm’. 


v 
7.1.3 [111] Orientation, Partly-Dried Oxides 
Fig. 19 shows log 7,, vs us — %, for oxides initially grown in steam 
and partially dried by heating in dry N, as described in Section ITI. 
Only n-type was analyzed in detail, so that we have 


c, (electrons): 4.6 X 10°*° cm’ from Fig. 19. 


Curve (a) in Fig. 20 shows the corresponding values of N,, . Curve (b) 
in Fig. 20 is a replot from Fig. 15 for comparison. 
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Fig. 18— N,. from (59) vs Ey — E; for same two samples in Fig. 17. Flat band 
is 18.5 in lower half of gap and 18.7 in upper half. 
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Fig. 19— Log rm vs us — %. after partial drying of n-type sample in Fig. 4. 
Measured points still fit (22). Donor density is 14 X 10% cm™*, ws = 13.7, and 
Coz = 5.96 X 10“°farads/em?. 
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Fig. 20— N., from (59) vs Ex — E.. (a) n-type sample in Fig. 19 and (b) replot 
of n-type sample from Fig. 15 for comparison. For (a), flat band is 13.7 and for 
(b), flat band is 13.5. 


7.1.4 Range of N,. 


As seen from Figs. 15 and 18, N,, in steam-grown oxides is in the 
10"' cm™*-eV~* range. Partly drying increases N,, as seen from Fig. 20. 
Complete drying or growing the oxide in dry oxygen at 1000°C results 
in N,, values in the 10” cm™*-eV* range or about an order of mag- 
nitude higher than for steam-grown oxides. 

N,, can be reduced below the values for steam-grown oxides by an 
order of magnitude by a method described by Balk.” First, an aluminum 
film serveral thousand A thick is evaporated over the entire surface 
of a steam-grown oxide. The sample is then annealed in H, or Ny, at 
350°C for 1/2 hour. After annealing, field plates are produced using 
photoresist techniques and chemically etching off the unwanted alum- 
inum. This annealing process is found to reduce N,, from the 
10" cm™-eV"* range characteristic of steam-grown oxides to the 
10°° cm™-eV~* range. Thus, NV, can be varied two orders of magnitude 
from the 10 cm ?-eV™ range characteristic of dry oxides to the 
10°° cm”*-eV* range. Capture cross section appears to remain ap- 
proximately the same throughout. 
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7.1.5 Fit to Statistical Model 


The circles in Fig. 21 represent points calculated from measurements 
on the [111] p-type sample with a steam-grown oxide in Figs, 14 and 15. 
These points were measured with frequency as variable and bias fixed 
at —1.9 volts which corresponds to ug — &@, = 6.04. The best fit to 
these points calculated from (39) is the solid curve in Fig. 21. This 
fit was obtained by substituting (59) for N,, in (89) and using (G,/w) max 
from the data. The value of @ required for this fit was found to be 
2.5 X 107° em? 

Fig. 22 shows a? vs space-charge width W. The values of @ are ob- 
tained by finding the best fit of (39) to G,/w vs frequency curves taken 
at different bias settings on the same sample. Space-charge width is 
calculated from the depletion layer capacitance obtained as described 
in Section VI from the measured capacitance and equivalent parallel 
conductance. 


7.2 Weak Inversion 


The G,/w curves obtained after correcting for C,, will peak in the 
weak inversion range at signal frequencies below about 1 kHz. Curves 
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Fig. 21— G,’ [o vs frequency showing how experimental points fit (39). An a 
of 2.5 X 10°° cm’ is used in (39) to get this fit. G)’ is equivalent parallel con- 
ductance in mhos derived from the measurements. Experimental points are 
measured at a bias of —1.9 volts (%, = 8.1) in depletion region on the p-type 
sample of Fig. 14. 
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SPACE CHARGE WIDTH (x 1o-5cm) 


Fig. 22 — at vs space-charge width where a is obtained by finding the best fit 
of (39) to experimental points taken on p-type sample in Fig. 14. a! can have 
any value within vertical bars through each point. Bars indicate precision with 
which (39) can be fitted to measured G,/w vs frequency points. No reason why 
the point at the largest W does not lie on the straight line is known. 


of capacitance and equivalent parallel conductance vs field plate bias 
measured at 50 Hz on a [111], n-type sample with steam grown oxide 
are shown in Fig. 23. An n-type sample was used to avoid the masking 
effects of lateral ac current flow.’® The capacitance peak below mid-gap 
persists down to 1.6 Hz, the lowest frequency used, and disappears 
at frequencies above 500 Hz. Above 500 Hz, the capacitance curve 
becomes the usual high-frequency curve.’ The low-frequency capaci- 
tance dispersion is shown in Fig. 24. 

G,/w vs frequency curves in this range can be fitted with a single 
time constant as shown in Fig. 25. The circles represent points calculated 
from measurements on an n-type sample with steam-grown oxide on 
the [111] orientation. These points were measured with frequency as 
variable and bias fixed at —3.3 volts which corresponds to uz — 
a, = —6.3 in weak inversion. The best fit to these points calculated 
from (17) is the solid curve in Fig. 25. The values of 7 and C, used 
in (17) to obtain this fit are 1.7 X 10°° seconds and 5.14 X 107° 
farads/cm”, respectively. . 

Fig. 26 is a plot of log 7, vs us — u,. These values are calculated 
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from measurements in the weak inversion region using wr, = 1 to 
get 7, . Fig. 27 is a plot of the corresponding values of C’, as a function 
of the position of the Fermi level with respect to mid-gap potential 
at the silicon surface. Values of C, are obtained from the peak of the 
G,/w curve using wr,, = 1 and (17). Values of N,, using N,, = C,/q 
are given on the right-hand ordinate axis of Fig. 27. Although these 
values are not exact, they give the correct order of magnitude of N,, . 

Fig. 28 shows a low-temperature drying experiment similar to those 
in Figs. 3 and 4 except the curves in Fig. 28 were measured at 50 Hz. 
Curves (a) and (b) of Fig. 28 are plots of capacitance and equivalent 
parallel conductance vs bias for a steam-grown oxide on n-type silicon. 
Curves (c) and (d) are the capacitance and equivalent parallel conduc- 
tance vs bias after heating in dry nitrogen at 350°C for 17 hours. 

Fig. 29 is a plot of capacitance and equivalent parallel conductance 
vs bias measured at 50 Hz on a sample having N,, in the 10°° cm™*- 
eV’ range. The sample is n-type oriented in the [100] direction and 
the low value of N,, is produced by the annealing process described 
previously. 
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Fig. 23— Capacitance and equivalent parallel conductance measured at 50 Hz 
vs field plate bias. Sample is the n-type one in Fig. 14. curves show capacitance 
and equivalent parallel conductance in the weak inversion region. 
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VIII. DISCUSSION 


The discussion is divided into three parts. The first part deals with 
the depletion region and the second with the weak inversion region. 
The third part discusses the limitations of the conductance technique. 


8.1 Depletion Region 


8.1.1 Steam-Grown Oxides on [111] Orientation 


It is seen from Fig. 14 that points calculated from the measurements 
fit (22) quite well for both n and p type samples. Time constant de- 
creases by a factor of 1/e for every kT'/q increase in ug — t,. The 
exponential dependence of time constant on surface potential shown 
in Fig. 14 means that time constant is inversely proportional to majority 
carrier density at the silicon surface as seen from (21) and (22). This 
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Fig. 24 — Capacitance vs field plate bias with frequency as parameter measured 
with apparatus described in Ref. 28, on n-type sample of Fig. 14. Curves show 
capacitance dispersion in weak inversion, 
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Fig. 25 — G,’/w vs frequency showing how experimental points fit (17). C, = 
5.14 X 10-farads/em? and 7 = 1.7 X 10° seconds are used in (17) to get this fit. 
G>’ is equivalent parallel conductance in mhos derived from measurements. Ex- 
perimental points are measured at a bias of —3.3 volts (%. = 19.8) in the weak 
inversion region on the n-type sample of Fig. 14. 


is the experimental evidence that majority carrier transitions make the 
dominant contribution to measured equivalent parallel conductance in 
the depletion region. 

The exponential dependence of time constant on surface potential 
is important evidence in addition to the drying experiments of Figs. 3 
and 4, that the dominant loss is due to transitions to and from interface 
states. This follows from the fact that no other relaxation processes in 
the system have time constants which depend exponentially on surface 
potential. To see this, we shall consider generation-recombination 
though impurity levels in the silicon near mid-gap, majority carrier 
transitions to and from impurity levels in the silicon near the Fermi 
level and the transit time of majority carriers across the silicon space- 
charge region. Bulk generation-recombination is insignificant because 
the Fermi level does not cross impurity levels near mid-gap for band 
bending in depletion as seen in Fig. 5(b). Therefore, generation-re- 
combination is taken to be zero in Figs. 6 and 8. The Fermi level can 
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Fig. 26—Log tm vs us — %, for n-type sample in Fig. 14 in weak inversion 
region. Curve shows that equivalent parallel conductance in weak inversion 1s 
dependent on majority carrier density at the silicon surface. 


still cross any impurity levels located near it [see Fig. 5(b)]. Majority 
carrier transitions to and from levels at this crossover point will con- 
tribute to the loss. The location of the crossover point from the inter- 
face will vary with bias. However, the crossover point is always at 
the same energy with respect to mid-gap so that majority carrier 
density there will be constant and independent of bias. Therefore, the 
time constant for this process will be independent of bias. As time 
constant is not observed to be bias independent in depletion in the 
samples measured, majority carrier transitions to and from levels in 
the silicon bulk must make a negligible contribution to the loss com- 
pared to majority carrier transitions to and from interface states. 

Loss due to the transit of majority carriers across the space-charge 
layer is negligible.**"’? It can be shown using the relations given in 
Refs. 11 and 12 that the value of space-charge resistance for majority 
carriers when the silicon surface is intrinsic is four orders of magnitude 
less than majority carrier capture resistance in the samples measured. 

The fact that curves (a), (b), and (c) in Fig. 14 can be fitted so well 
by straight lines implies that capture cross section in these regions is 
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to a good approximation independent of energy in the bandgap. This 
independence is an experimental fact for which no satisfactory explana- 
tion exists at present. Because capture cross sections are independent 
of energy, they will be unaffected by statistical fluctuations of surface 
potential. The values of capture cross section obtained from the linear 
portions of Fig. 14 are therefore truly characteristic of the interface 
states present. Curves (b) and (c) in Fig. 14 measured on n-type are 
characterized by two types of interface states having different capture 
cross sections with an abrupt transition between them. The reason for 
this is not understood. 

Fig. 14 shows that N,, is slowly varying with energy. It is reasonable 
to conclude from this, and the observation, that the time constant 
varies monotonically with surface potential over the same portion of 
the band gap and that there is a continuum of interface states rather 
than a discrete level. It is also seen from Fig. 15 that N,, is in the 
10" em" *-eV~™ range. This means that the capturing centers are spaced 
too far apart in the plane of the interface for the wavefunction of an 
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Fig. 27 —C, and estimated N,, vs Ey — Ei for n-type sample in Fig. 14 in the 
weak inversion region. Estimated N., = C./q. Cs increases monotonically with 
Er — Ki. 
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electron in one center to overlap a neighboring center. Transitions from 
one center to another are therefore, highly improbable so that it may 
be assumed that we do not have a band but just a continuum of levels 
closely spaced in energy. Transitions will only occur between the 
majority carrier band and levels within a few kT’/q of the Fermi level. 
Cascading transitions from center to center which could occur in a 
band would probably have a different dependence of time constant on 
surface potential than shown in Fig. 14. Therefore, cascading transitions 
were not considered in the theory developed in Section IV. 


CAPACITANCE IN pF 





CONDUCTANCE IN N-MHOS 








FIELD PLATE BIAS IN VOLTS 


Fig. 28— (a) capacitance and (b) equivalent parallel conductance measured 
vs field plate bias before drying; (c) capacitance and (d) equivalent parallel 
conductance vs field plate bias of same capacitor after heating in dry nitrogen 
for 17 hours at 350°C. This is the same sample as Fig. 4 only measured at 50 Hz. 


ELECTRICAL PROPERTIES OF SILICON-OXIDE INTERFACE 1109 





CAPACITANCE IN PF 








EQUIVALENT PARALLEL CONDUCTANCE IN N-MHOS 





FIELD PLATE BIAS IN VOLTS 


Fig. 29— Capacitance and equivalent parallel conductance measured at 50 Hz~- 
vs field plate bias. Field plate diameter is 3.8 X 10“cm, silicon resistivity 1 ohm- 
em, n-type, [100] oriented, and Cor = 2.8 X 10“farads/em*. Oxide is grown in 
steam then He annealed as described in text to produce low Ns:. 


The integration of (4) and (26) was performed for the case where 
N,, and capture cross section do not vary appreciably over an energy 
interval of several kT/q. Figs. 14 and 15 show the experimental justifica- 
tion for this. 

In Fig. 16, log 7,, is plotted as a function of Y, rather than ug — @, 
to illustrate more clearly the temperature dependence. An alternate 
and equivalent form of (22) which follows from (21) for p-type is 





pete Wr exp (q¥./kT). (60) 


va 


Curve (a) in Fig. 14 fits (60) quite well with a slope qg/kT where T = 
204°K. Curve (b) fits with the slope for 300°K. Thus, the temperature 
dependence expected from (60) is observed. This agrees with Shockley- 
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Read theory’® which is used in Section IV. Capture cross section can 
be obtained from (60) by extrapolating the curves in Fig. 16 toy, = 0. 
Doing this, we get oc, = 8.8 X 10°*’ cm’ from curve (a) and o, = 
2.1 X 10°*° cm’ from curve (b). The capture cross section from curve 
(b) is of course in agreement with o, from curve (a) in Fig. 14. However, 
it is 2.4 times larger than o, from curve (a) of Fig. 16. This is probably 
due to the error in finding the value of A from (52). Normally, A can 
be found only within 20 to 30 mV of its correct value by the technique 
of Section V. Part of this error is due to neglecting the difference be- 
tween hole and electron effective mass. It is considered likely that in 
reality capture cross section is nearly independent of temperature in this 
range. The apparent variation is ascribed instead to the errors in finding 
the value of A. 


8.1.2 Steam-Grown Oxides on [100] Orientation 


The same general conclusions can be reached for the [100] samples 
from Figs. 17 and 18 as for the [111] samples just discussed but there 
are two differences. The first difference is that the time constant curves 
in Fig. 14 for the upper half of the gap in [111] p-type are similar to 
the curves for the lower half of the gap in [100] p-type in Fig. 17. Also, 
the curves in the lower half of the gap in Fig. 14 for [111] p-type are 
similar to the curves in the upper half of the gap in [100] n-type in 
Fig. 17. The second difference is that interface state density in Fig. 15 
for the lower half of the gap in [111] p-type is about twice as great 
as in Fig. 18 for the lower half of the gap in [100] p-type. However, 
interface state density in Fig. 15 for the upper half of the gap in [111] 
n-type is about the same as in Fig. 18 for the upper half of the gap 
in [100] n-type. The reason for these differences is not understood. 

Built-in charg2 density in the [111] samples varies from sample to 
sample from 5 X 10° cm™” to 9 X 10" cm” in both n and p-type. 
This is about twice as great as in the [100] samples confirming the 
observations of Balk.”” Also, the increase of N,, towards the band 
edges as shown in Figs. 15 and 18 is consistent with the results of Gray 
and Brown.”” 


8.1.3 [111] Orientation, Partly-Dried Oxides 


Fig. 19 shows that interface state time constants fit (22) after partial 
drying. Capture cross section for n-type has not changed appreciably 
although two types of states were not found. This may be due to the 
fact that too small a range of surface potential was covered in the 
measurements. Because capture cross section did not change, partial 
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drying does not appear to introduce new types of states—but only 
increases the number of states already present. 

Comparison of curve (a) to curve (b) in Fig. 20 shows that interface 
state density for the partly-dried oxide is greater than for the wet 
oxide. This can be seen directly from Fig. 3 and 4 for p and n-type 
by the increase in the peak of the measured conductance with drying. 

Fig. 3 for p-type shows by the shift of the curves to higher negative 
bias that a net positive charge has been induced in the oxide by partial 
drying. Fig. 4 for n-type shows by the shift of the curves towards less 
negative bias that a net negative charge has been induced in the oxide 
by partial drying. No conclusions about these results will be drawn 
because it is not certain that other changes independent of interface 
state density have not occurred in these samples. 

The increase in interface state density with drying can explain the 
results observed in n*p junctions in a previous paper.” In Ref. 31, 
it was found that drying an oxide heavily contaminated with alkali 
metal ions drastically reduced the current flowing in an n channel 
along the surface of the p-type material. The increased interface state 
density caused by drying results in more channel electrons being 
captured by interface states. This markedly reduces field-effect mobility 
which in turn results in an increase in the sheet resistance of the inver- 
sion layer. Although the inversion layer may be terminated by a defect 
at the edge of the wafer which shorts it to the p substrate, the current 
flow will be reduced to a much lower level than before. 


8.1.4 Range of N,, 


The range of N,, from 10’cm™*-eV~™ for very dry oxides through 
10° cm™*-eV* for wet oxides to 10°° cm™*-eV™' for aluminum clad 
oxides annealed in H, or N, are empirical observations. It is not de- 
finitely known what these centers are. Therefore, an explanation can 
only be speculative. However, these results are qualitatively similar 
to previous work on ‘‘bare” surfaces which had at most a few mono- 
layers of oxide.*” Exposure to water vapor always reduced N,, by a 
few orders of magnitude.” For the thick oxides studied in this work, 
water was most likely present in the form of OH groups which have a 
permanent dipole moment. An interesting model to explain the annihila- 
tion of fast states on ‘“‘bare’’ surfaces has been proposed by Rzhanov.** 
Assuming that each interface state also has a permanent dipole moment, 
the lowest energy configuration will occur at the interface when the 
dipole moment of an H.O or OH group is aligned antiparallel with the 
dipole moment of the nearest interface state. This could reduce the 
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capture probability of the interface state™* to such an extent that there 
is no measurable loss due to transitions involving it. The only loss 
measured would be due to transitions to and from the remaining un- 
paired states. For thick oxides, this fits the observed decrease in N,, 
with increasing wetness and the observation that measured capture 
cross section remains unchanged. This model is, however, only one of 
several capable of explaining the results. 


8.1.5 Statistical Model 


The time constant dispersion actually observed is much broader than 
expected for a continuum. This can be seen by comparing Fig. 21 to 
the curves in Fig. 13. Three mechanisms which could cause this broaden- 
ing are considered. They are: 


(z) Tunnelling of majority carriers to states randomly distributed 
into the oxide.” 
(77) Fluctuations of capture cross section over the plane of the 
interface. 
(727) Random fluctuations of surface potential over the plane of the 
interface. 


The tunnelling mechanism is unlikely to be the dominant cause of 
the time constant dispersion for the following reasons. The range of 
frequencies used in the measurements is from 50 Hz to 500 kHz. Thus, 
only those interface states having time constants shorter than several 
milliseconds will be measured. The time constant distribution for 
tunnelling would be’ 


Teo (nooo) exp (2K..£), (61) 


where gp is the true electron capture cross section of the states, 1/2K, 
is the tunnelling decay constant, and é is the distance into the oxide 
measured from the interface. For any reasonable value of 1/2K, , such 
as 1 A given in Ref. 2, and reasonable values of o, , such as 10°** em? 
to 107** em’, it can be seen from (61) that states located more than 
a few A into the oxide would have time constants much too long to 
be measured in the frequency range between 50 Hz and 500 kHz. 
Hysteresis in the capacitance vs bias characteristic which can be ob- 
served by slowly varying bias from a low to a high value and then 
back is negligibly small. Hysteresis would be observed when there is 
an appreciable density of states deep in the oxide having time constants 
longer than the time it takes to vary the bias because the charging 
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and discharging of these states would lag behind the bias variation.” 
The fact that no hysteresis is observed means that the density of states 
in the oxide having time constants of the order of minutes is negligible 
in the samples measured. Finally, it is shown in Appendix D that for 
any reasonable value of K, , the shape of the G,/w vs log w curve cal- 
culated using (20) for the continuum of states and (61) for the tunnelling 
time constant distribution differs markedly from the observed G,/w vs 
log w curve. 

Although capture cross section is observed to be independent of 
energy over large portions of the band gap from Fig. 14, capture cross 
section could depend upon the location of the state in the plane of 
the interface. The assumption that capture cross section fluctuates over 
this plane does not give a broad enough time constant dispersion to 
fit the observed G,/w vs frequency curves. 

Random fluctuations of surface potential over the plane of the inter- 
face is the model which fits best. One can see from (22) that small 
fluctuations of u, will cause large fluctuations in 7,,. This broadens 
the time constant dispersion sufficiently to fit the experimental curves. 

Random fluctuations of surface potential can be caused by: 


(z) A random distribution of built-in charges and charged interface 
states over the plane of the interface. 
(iz) A random distribution of ionized impurities in the silicon. 
(277) Random fluctuations of oxide thickness over the plane of the 
interface.* 


Of these, the first two are the basis for deriving (39) in Section IV. 
The last was excluded because no dependence of time constant disper- 
sion on oxide thickness was observed between 500 and 2000 A in the 
samples measured. Fig. 21 shows the fit of (39) to experimental values 
of G,/w measured in the frequency range between 50 Hz and 500 kHz 
at a mean surface potential of 8.1. Fig. 21 shows that (39) accurately 
fits the experimental points. The fit of (39) to the experimental G,,/w vs 
frequency points is equally accurate for all the other values of mean 
surface potential «, measured in depletion. The only independent 
parameter in (39) varied to obtain these fits is the characteristic area a. 
By fitting (89) to the experimental G,/w vs frequency points at each 
value of @, in this way, the values of a obtained are found to be related 
to space charge width W. This relation is shown in Fig. 22 where the 
length of the side of each square characteristic area a’ is plotted against 


* This effect was pointed out to us by C. R. Crowell.35 
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W. Fig. 22 shows that a? increases linearly with W and has a larger 
magnitude. To understand this relationship, consider a positive built-in 
charge or charged interface state located at a point in the plane of 
the interface. This charge is sandwiched between the plane of the field 
plate and the plane at the edge of the space-charge region a distance 
W away. An image charge is induced in each of these planes. The 
image charge in each plane induces another in the opposite plane and 
so on. Thus, the charge at the interface induces an infinite number 
of image charges in the field plate and space-charge edge planes. Let 
us now add a number of charges in the neighborhood of this charge 
in the plane of the interface and calculate the charge distribution 
induced by all the charges and their images in the plane at the edge 
of the space-charge region. The largest spacing between charges which 
results in a constant induced charge distribution in the plane at the 
edge of the space-charge region is a’. Thus, charges within a charac- 
teristic area are indistinguishable as separate charges which makes 
surface potential over a characteristic area uniform. The characteristic 
area is a measure of the granularity of the surface-charge distribution 
seen at the edge of the space-charge region. A calculation based on 
the image charge model shows that a’ has the values in Fig. 22 and 
a linear dependence on W. The value of a can be obtained within a 
factor of 2 from the image charge model so that a is not completely 
arbitrary. 

We shall show next that the random distribution of built-in charges 
and charged interface states is the dominant cause of surface potential 
fluctuations in the samples measured. Rewriting (36) from Section IV 
for the standard deviation of surface potential caused by the random 
distribution of built-in charges and charged interface states gives 


_ Wa,Bq/e)_, 
"= W@)C. + ed 


Curve (b) in Fig. 30 is the variance o? calculated from (62) as a function 
of W. Q calculated from (31) and (32) with u, = @, is found to be 
independent of bias from —1.8 volts to —2.2 volts. This bias range 
corresponds to mean surface potential from 10.3 to 3.8. Assuming a 
constant value of interface state density of 3 X 10'° cm™*-eV™~ in the 
lower half of the gap from Fig. 15, the change in total interface state 
density is 3 X 10" X (10.3 — 3.9) X 0.026 = 5.1 X 10” cm”. This 
is negligibly small compared to Q/q = 9.8 X 10'* cm”. Curve (b) 
in Fig. 30 is calculated using Q = 1.57 X 107‘ coul/em’, C,, = 5.7 X 
107° farads/em’ and the values of a from Fig. 22. | 
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Fig. 30 — (a) total variance of surface potential (¢.? + 2°) caused by random 
distributions of built-in charges and charged interface states and ionized accep- 
tors in the silicon space-charge region. o, and oz are calculated from (62) and 
(63), respectively. (b) variance of surface potential o,? caused by random distribu- 
tion of built-in charges and charged interface states alone. ¢, is calculated from 
(62). (c) variance of surface potential os? caused by random distribution of ionized 
acceptors in silicon space-charge region. os is calculated from (63). (d) variance 
Pe aay potential ¢.* caused by fluctuations of oxide thickness. oz is calculated 
rom (64). 


For the standard deviation of surface potential caused by a random 
distribution of ionized acceptors in the silicon space-charge region, 
we rewrite (37) derived in paragraph B.1 of Appendix B 


_ 96LN,Wa,)F tL — exp (—a,)]. 


oe QW (a,) Cn + ea] ee 


Curve (c) in Fig. 30 is the variance «3 calculated from (64) as a func- 
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tion of W. The values used are: C,, = 5.7 X 107° farads/cm’ and 
N, = 2.1 X 10° cm™. 

Finally, for the standard deviation of surface potential caused by 
oxide thickness fluctuation, we rewrite (101), derived in paragraph B.2 
of Appendix B, 


_ IN.BW"G,)[1 — exp (—a,)] dé, 
cal [Wii.)eo + eu] 


Curve (d) in Fig. 30 is the variance o? calculated from (64) as a 
function of W. The fluctuations of oxide thickness arise from the random 
nature of the oxidation process. It is reasonable to take dé as about 
one silicon lattice constant or 5A. The values used in (64) to get curve 
(d) in Fig. 30 are: ¢ = 595 A, Nu = 2.1 X 10° em™, and dz = 5 A. 

Curve (a) in Fig. 30 is of + of vs W obtained by fitting (39) to the 
experimental G,/w vs frequency points. Fig. 30 clearly shows that the 
random distribution of built-in charges and charged interface states is 
the dominant cause of surface potential fluctuations in the samples 
measured. It can be easily seen that a? should vary linearly with W as 
follows. Over the range of surface potential measured, the experimental 
G,/w vs frequency curves broaden very slightly with increasing surface 
potential. This is shown quantitatively in curve (a) of Fig. 30. Over 
the same range of surface potential, curve (c) in Fig. 30 is virtually 
independent of W. The dominant term oa? in curve (a) is therefore, 
nearly independent of W as shown in curve (b) of Fig. 30. For oc, con- 
stant, an inspection of (62) will show that for Q constant and WC,, < é,; 
as is the case in the samples measured, a’ must vary linearly with W. 

The density of built-in charges and charged interface states is in the 
high 10'' cm™ range in the samples measured. Therefore, these charges 
are about 100 A apart on the average which is too far for coulombic 
interaction between them to be important. This makes the assumption 
that they are randomly distributed a reasonable one. The mean number 
of built-in charges and charged interface states N in a characteristic 
area a can be calculated from the relation VN = a/q (see Section IV) 
using values of a obtained from Fig. 22. N is found to vary from 140 
charges at W = 6.9 X 107° cm to 246 charges at W = 1.07 X 107° cm 
For these values of NV, there is negligible error in using the Gaussian 
approximation to the Poisson distribution (28) in deriving (39) in 
Section IV. The mean number of ionized acceptors calculated from the 
relation M = W°N, varies from 26 charges at W = 1.07 X 10°° cm 
to 7 charges at W = 6.9 X 107° cm. For all values of M except perhaps 
the last, there is negligible error in using the Gaussian approximation 


(64) 
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to the Poisson distribution (81) in paragraph B.1 of Appendix B in 
deriving (39). Any error at W = 6.9 X 107° cm is not important to 
the final results because the contribution of of to the total variance 
is so small. The assumption of small fluctuations made in deriving (89) 
also seems to be reasonable because of the rather good fit obtained 
to the experimental points. 

One can decide from (62), (63), and (64) which influence will be 
dominant for samples other than the ones used here. In spite of the 
fact that the samples investigated here were dominated by random 
fluctuations of built-in charges and charged interface states, it should 
be borne in mind that the other influences may become important in 
other cases. When built-in charge and interface state density is low 
and the oxide relatively thick, then bulk doping fluctuations will 
dominate. When the oxide is thin and the impurity density is high, 
random fluctuations of oxide thickness become important. The general 
conclusion is that there will always be random fluctuation of surface 
potential in simiconductor-insulator interfaces. 

No data has been taken at flat bands or in the accumulation region. 
However, it is expected that the equivalent circuit in Fig. 8(a) will 
also be valid in this region with all the distributed time constants 
shifted to shorter values. 


8.2 Weak Inversion Region 


In this part, it will be shown that the loss in weak inversion can 
arise from generation-recombination either through bulk states or 
interface states and how the conductance technique can be used to 
distinguish between them. It will be shown also that the observed 
effects in this region can be explained with a continuum of states rather 
than a single level near a band edge. 

When the silicon surface becomes inverted, the*Fermi level will 
cross impurity levels near mid-gap as seen from Fig. 5(c). Generation 
recombination through bulk states as well as through interface states 
can contribute significantly to the measured loss. These two cases can 
be distinguished by the bias dependence of the measured equivalent 
parallel conductance. When generation-recombination through interface 
states dominates the loss, the equivalent parallel conductance goes 
through a peak as a function of bias in weak inversion because time 
constant varies inversely with majority carrier density at the surface 
(see paragraph 4.6). This case is illustrated in Fig. 23 where equivalent 
parallel conductance goes through a peak in weak inversion and drops 
to a very low value in strong inversion. 
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When generation-recombination through bulk states dominates the 
loss, equivalent parallel conductance does not go through a peak as 
a function of bias in weak inversion. The reason for this is that the 
loss due to this process is bias independent. This follows from the fact, 
that hole and electron densities at the crossover point [see Fig. 5(c)], 
where the Fermi level crosses impurity levels located near mid-gap, 
are independent of bias. This case is illustrated in Tig. 29. The slight 
indication of a peak in the conductance curve means that generation 
recombination through interface states makes a slight contribution to 
the loss. However, the primary effect is that of a bias independent 
loss. The variation of measured conductance with bias is mainly due 
to variation of capacitance with bias. 

Fig. 23 is measured on a sample having N,, in the 10"* em™*-eV™ 
range while Fig. 29 is measured on a sample having N,, in the 10'° 
cm” ’-eV~* range. Thus, samples can be made in which one or the other 
process dominates the loss. It should be noted that the conductance 
technique can be used to investigate generation-recombination through 
interface states or bulk states in the weak inversion region by choosing 
an appropriate sample. The investigation of generation-recombination 
through bulk states is beyond the scope of this work so that most 
of the samples investigated have interface state loss dominant in weak 
inversion. Therefore, bulk generation-recombination is made zero in 
the equivalent circuits of Fig. 9. 

In general, samples having N,,, in the 10'’ cm™*-eV’ range or higher 
will have interface state loss dominant in the weak inversion range. 
Additional evidence for this in such samples is shown in Fig. 28. The 
only difference between Fig. 28 and Fig. 4 is the low signal frequency 
which is used in Fig. 28 to reveal the behavior of equivalent parallel 
conductance in the weak inversion region. The conclusion is again 
drawn from Fig. 28 that interface state loss must be dominant as from 
Figs. 8 and 4 discussed previously in Section III. 

The striking feature of the experimental G,/w vs frequency curves 
at each bias in weak inversion is that they can be accurately fitted by 
a single time constant using (17). An example of this is shown in Fig. 25. 
Three possible explanations of this will be considered. 

(z) A single level state located at an energy in the silicon band gap 
corresponding to the bias at the capacitance peak in Fig. 28. 
(22) A single level near the valence band edge. 

(ziz) A continuum of interface states over the silicon band gap. 

The first explanation can be ruled out by an examination of Figs. 24, 
26, and 27. Fig. 24 shows that the capacitance peak shifts to greater 


ELECTRICAL PROPERTIES OF SILICON-OXIDE INTERFACE 1119 


negative bias with decreasing signal frequency. The peak must always 
occur at the same bias for a single level. In Figs. 26 and 27, the range 
of u, — u, and Hy, — EH; corresponds to a range of bias which includes 
the capacitance peak in Fig. 23. Both 7,, in Fig. 26 and C, in Fig. 27 
increase monotonically through the corresponding capacitance peak in 
Fig. 23. For a single level state, C, would go through a peak and 7,, 
first would be constant for Hy on the valence band side of the single 
level and then it would decrease exponentially with u, for #, on the 
conduction band side. 

The range of Hp — EH; and ug — u, in Figs. 26 and 27 is limited by 
the capacitance dispersion shown in Fig. 24. This means that (52) 
cannot be used without significant error too far beyond the capacitance 
peak (at reasonable frequencies). 

To rule out the possibility of a single level state near the valence 
band edge, G,/w vs frequency curves were obtained on a p-type sample 
This sample had steam-grown oxide on a 1 Q-cm crystal oriented in the 
[100] direction. The [100] direction has a lower built-in charge density 
so that masking effects due to lateral ac current flow into the inverted 
layer beyond the field plate are minimized. A single time constant was 
again found to be characteristic of the G,/w vs frequency curves in the 
weak inversion region. The experiment was repeated on an n-type 
sample oriented in the [100] direction having a steam-grown oxide with 
the same result. The fact that the same result was obtained in weak in- 
version on both n and p-type samples and the lack of evidence for a 
single level state in the same energy range in accumulation in these 
samples makes such a state as an explanation of the effect very unlikely. 
The observations can be best explained in terms of a continuum of: 
states as done in paragraph 4.6. 

Evidence that this effect is dominated by the majority carrier density 
at the silicon interface is given by Fig. 26. Fig. 27 shows that C, in- 
creases so rapidly with H’, that the approximation of a constant density 
of states over an interval of k7’/q cannot be made. Thus, (18) cannot 
be simply integrated. The fact that C, is so rapidly varying is the 
reason why the curve in Fig. 26 does not have the same magnitude 
of slope as the curves in Fig. 14. This rapid variation of C,, also explains 
the capacitance peak in Fig. 23. It can be seen that (16) will go through 
a peak if C’, has a strong dependence on bias. 

Comparison of Fig. 27 with Fig. 15 shows that interface state density 
in the lower half of the silicon bandgap in n-type has a different magni- 
tude and dependence on position of Fermi level than in the lower half 
of the gap in p-type. The reason for this is not understood. 
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8.3 Limitations of the Conductance Technique 


Interface state densities in the 10° cm~*-eV~* range have been meas- 
ured by this technique using the apparatus described in Section V. 
For a field plate diameter of 5 X 107? cm, this corresponds to about 
2 X 10° states/eV. For a signal of 20 mV peak-to-peak, transitions 
involving only about 4 X 10* states can be detected. 

It is important to use as thin an oxide layer as practicable to get 
large values of capacitance and equivalent parallel conductance for a 
given V,, and to minimize the error in extracting C,, . 

The range of bandgap energy over which measurements were made 
in this work was limited by the frequency range 50 Hz to 500 kHz of 
the two bridges used. 


IX. SUMMARY AND CONCLUSIONS 


(t) Majority carrier transitions to and from interface states make 
the dominant contribution to the measured equivalent parallel con- 
ductance in the accumulation, depletion, and inversion regions. This 
fact makes it possible to use simplified equivalent circuits and extract 
interface state properties for each type of carrier independntly. 

(iz) A continuum of Shockley-Read type interface state levels closely 
spaced in energy across the silicon band gap seems to be characteristic 
of the Si-Si0, interface. 

(zit) Interface state densities in the 10'' cm~*-eV~' range are usual 
for “as-grown” steam oxides. 

(iv) Interface state density can be increased to the 10° cm~*-eV™’ 
range by drying the oxide. 

(v) Interface state density can be reduced to the 10"° cem™*-eV~’ range 
by H. or N, annealing at low temperatures with an aluminum film 
deposited on a steam-grown oxide.” 

(vt) Capture cross sections for electrons and holes are independent 
of energy over large portions of the silicon bandgap. They are also 
independent of temperature, the processes of drying or H, or N, an- 
nealing, and, except for subtle differences, crystal orientation. 

(vit) In depletion, a time constant dispersion is observed which is 
much broader than expected for a continuum. The broadened time 
constant dispersion is determined mainly by: statistical fluctuations 
of surface potential. In the samples studied, a random distribution 
of built-in charges and charged interface states was found to be the 
primary cause of the surface potential fluctuations. This model provides 
an accurate fit to the experimental points and yields a characteristic 
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area whose linear dimension has about twice the magnitude and has 
the same bias dependence as the silicon space-charge width. Charges 
within a characteristic area cannot be distinguished as separate charges. 
Random fluctuations of surface potential can also arise from a random 
distribution of ionized impurities in the bulk silicon and by random 
fluctuations of oxide thickness. In samples where the built-in charge 
and charged interface state density is low or the oxide layer thin, 
these can replace the random distribution of built-in charges and 
charged interface states as the dominant causes of surface potential 
fluctuations. 

(viz) In weak inversion, the fast response time of minority carriers 

and the conductance of the inversion layer results in the observation 
of a single time constant at each bias. 
_ (ix) The interface state branch of the equivalent circuit in the deple- 
tion-accumulation region where a, < wu, is given in Fig. 3l(a). The 
equivalent circuit in the mid-gap region where @, = us - a few kT'/q 
is shown in Fig. 31(b). Finally, the equivalent circuit in the weak 
inversion region where uz < u@, < 2u, is shown in Fig. 31(c). 

(x) It is impossible to make an MIS structure in which surface po- 
tential is absolutely uniform because even if built-in charge and inter- 
face state density is made small, random distribution of ionized bulk 
impurities or random fluctuations of oxide thickness will still be present. 
_ (at) The accuracy required for extracting this detailed a picture of 
the Si-SiO, interface from MIS capacitor measurements could only be 
provided by the conductance technique (see Fig. 1). 


(b) 
Fig. 31— (a) Schematic of interface state branch of equivalent circuit in deple- 
tion-accumulation region where %, < us. (b) Schematic of interface state branch 
of equivalent circuit in mid-gap region where %, = wz + a few kT/q. (c) Schematic 


of interface state branch of equivalent ‘circuit in weak inversion region where 


Us < Us <2 UB « 


(a) 
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APPENDIX A 


In this Appendix, interface state admittance is derived from (3): 


2,(t) = @N.c,{1 — fn. — a ef). (65) 
Expressing f(é) as the sum of a de and an ac part: 
1Q =f. + 4, (66) 


where f, is the Fermi function established by the bias and 6f the change 
caused by the ac signal. Similarly, 


n,(t) =1.. + én, , (67) 


where n,, is the electron density at the silicon surface when the Fermi 
level is at the trap level and 6n, the change caused by the ac signal. 
Substituting (66) and (67) into (65) and making the small signal ap- 
proximation by neglecting second order terms, we get 


4.() = qN cil — fo)tso + (1 — fo) dm. — Meo Sf] — gen .(f. + 87). 
(68) 
From detailed balance 
QN .c.(1 — f.)tso = GenN fe « (69) 
Solving (69) for e, and substituting into (68) we get 


i,(t) = wo ( — f.) 5M. — Nso |. (70) 
The net current density can also be expressed as 
i) = on. (71) 


* dt 
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Equating (71) to (70) 


df 


Ft ed = fo) 8mty — Cathas SE 72) 


fo 


The small signal variation of the Fermi function is 6f = f,, exp (jwt). 
Combining this with (66) results in 


al ju af. (73) 


Substituting (73) into (72) and solving for 6f 


—_ fo(1 = f.) on, 
ae ey Bee ary a 





Substituting (74) into (70) results in 


jogN .f.(1 — f.) dns 
Oe jeofo/CyMeo) Meo ©) 


The ratio 6n,/n,, is equal to the ac surface potential. This can be 
seen as follows: 





dn, _ Ni exp (u, — Up) bu, =e (76) 
is, n; exp (Us — Uz) 





where n, is the intrinsic carrier density and u, is the silicon band bending 
or surface potential in units of k7'/q and uy, the potential difference 
between mid-gap and Fermi level in the quasi neutral region of the 
silicon in units of kT’/q. Expressing 6u, in volts is 





bu, = 775 OY - (77) 
Equation (75) becomes 
| @ Nufo(l — f.) bbe. 
t() = je kT (A + jof./entso (78) 


This can be written as 
1({t) = Y, 6y,,; (79) 


where 





Y= ORT (1 z ae /c ae 
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APPENDIX B 


B.1 Random Fluctuations of Impurity Charges in the Silicon 


We shall use the model proposed by Shockley”® for this calculation. 
Otherwise the procedure for finding the standard deviation of the 
surface potential is similar to that for the case of the surface charges. 

Let M be the number of ionized acceptors in a characteristic volume 
in the silicon space-charge region. When J is large, the probability 
that there are M ionized acceptors in a characteristic volume is given 
by the Gaussian approximation of a Poisson distribution 


P(M) = (21M)? exp [—(M — M)*/2M]. (81) 


The characteristic volume from Ref. 23 is a cube in which the length 
of a side is equal to the space-charge width W. The first transformation 
of the probability density function is from number of charges to number 
density 


P(Na) = PM) dM /aN, , (82) 


where N, is the ionized acceptor density. The relation between 
and N, is 


M = N,W’. (83) 
Combining (81), (82), and (88), the result of the first transformation is 
P(N4) = (20N,W’*)?W° exp [—W*(N4 — Nu)?/2N 4), (84) 


where VV, is the mean acceptor density. 
The second transformation from number density to surface potential is 


Plu.) = P(N.) dN4/du, . (85) 
Rewriting (31) and (82) results in 
Q = Cu2(v = u,/B) Si (2qe.;N 4/8)*[exp (—us) ae Us — 1}. (86) 
Solving (86) for NV, is 





B ) [Q = Co2(Vo aE AB). ‘ (87) 


2qe.:/ [exp (—u,) + u, — I 


Equation (87) is not a single-valued relation between N, and u,. We 
therefore restrict the problem as with the surface charges to the case 
where the fluctuations N4 — N4 are very small. For very small fluctua- 
tions, (87) can be differentiated assuming Q and oxide thickness to be 
uniform to get 


vs =( 
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=} 428 2N4\) Cu 
aN aS (2% [exp (—u,) +u, — 1]? 


_Nl1 — exp (—4,)] 
* Texp (=u) Fa, — mi du,. (88) 


Equation (88) is now expressed as a function of u, using (32) 


_ 6st AQ se _ gesiN 4B ; 
= B sex (tue i P= eR Atha) X82) 


Solving (89) for exp (—u,) + u, — 1, we have 














du, 


exp (—u.) tu, —1 = G48 wy — exp(—u)l. 00) 


Substituting (90) into (88) we get 


os 2(WC., + sz) 
g8W*[1 — exp (—u,)] 


Because N, is given by (87) when u, = &,, dN, can be evaluated 
about V, at each bias from 


dN 4 = du, : (91) 


a 2[W(a.)C 2 + és: | 
qaW*a,)[1 — exp (—4,)] 
Equation (92) is the transformation equation sought. Replacing dN 4 


and du, in (92) by the small fluctuations VN, — N4 and u, — @, re- 
spectively, we get 


dNa = du, . (92) 


WU )Cee + eri] _ 
Na ~ Na = —~CsW@,)\[L — exp (—ay]  — “: 8) 


Combining (84), (85), (92), and (93), we get 
P(u,) = (200%)? exp [—(u, — &,)°/20%], (94) 





where the standard deviation o, is 


_ @B(N.WG,)}*[1 — exp (—4.))_ 








OW (ih)Cun + ei] 8) 
B.2 Random Fluctuations of Oxide Thickness 
Solving (86) for x, the oxide thickness, 
€or =7= Enz(V, + u./B) z; (96) 
Cx Q —~ {(2qe,:N4/8)lexp (—u,) i Us — 1] } ‘ 
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where e,, is the dielectric permittivity of the oxide layer. If Q is not 
zero, it will influence the fluctuations of surface potential caused by 
fluctuations in oxide thickness even if it is uniform. To get the influence 
of oxide thickness fluctuations independent of surface charge, we let 
Q = 0. Thus, (96) becomes 


€or(Vo + u,/B) 








4 r° 97 
* = —~T@qe.iNu/6)lexp (—u,) Fw, — 1) we 
Differentiating (97), assuming N, to be uniform, results in 
ee eee on 
(2¢e.:N.4/8)* \6lexp (—u,) + u, — 1] 
_ @, + u,/6)[1 — exp Coal 
2[exp (—u,) + u, — 1? alos (98) 
Eliminating exp (—w,) -++ u, — 1 from.(98) by using (90) we get 
ie Weos F esi ‘il: (99) 


~ BN W*(L — exp (—u,)] 
Equation (97) gives € when u, = @, so that dx evaluated about Z is 


_ W(a,)éor + Les; 
gBN4.W*(a,){1 — exp (—%,)] 


dt is the standard deviation of x and du, the standard deviation of 
surface potential thus, 


__ _ _ gN,BW*(a,)[1 — exp (—%,)] ,. 
a a (W(a,)e + Ze,:] e 


di = dit, . (100) 


(101) 


APPENDIX C 


There is a simple way to relate C,(w, a,) to measured (G,/w)max - 
The distributed interface state network in Fig. 8(a) can be represented 
at one frequency and bias by an equivalent series R,C, network in 
which #, and C, are frequency and bias dependent. The equivalent 
circuit will then look like Fig. 6. If we let R, and C, be independent 
of frequency, G,/w vs frequency from (17) can be made to coincide 
with the measured G,/w vs frequency curve at each bias at (G,/w)max 
as shown in Fig. 32. Therefore, from (17), C,(w, @,) = 2(G,/w)max at 
each bias and we obtain (54) by substituting this into (53) and solving 
for Cp. 
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Gp/w —> 


SINGLE 
TIME 
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LOG FREQUENCY ——-> 


Fig. 32—Schematic of G,/w vs log frequency showing how C: is obtained 
from measured G/w peak. 


APPENDIX D 


We shall calculate G,/w vs w which would result from transitions 
between the conduction band near the surface and states distributed 
with uniform density into the oxide. The time constant distribution 
for tunnelling from (61) is 


Tr = To exp (2K,£), (102) 


where 7,, = (n,.00,)°. 
The contribution to G,/w from states located between £ and — + dé 
in the oxide is 


d(G,/w) = (q/2w)(N ../%1)T ot exp (—2K.£) 
‘In [1 + w’72, exp (4K.£)] dé, (103) 


where (102) has been substituted for 7 in (20) for the continuum of 
states, N,,/z, is the uniform density of states in the oxide, cm™°-eV™’," 
and x, is the oxide thickness, cm. 

Total G,/w from all the states in the oxide therefore, is 


Gyo = (g/2)(Nue/t)r0 ~ exp (—2K.2) 


-In [1 + w?72, exp (4K,8)] dé. (104) 
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To get G,/w vs w, (104) is numerically integrated on an IBM 7094 
computer using the trapezoidal rule. The result is shown in Fig. 33. 
Because we are interested only in the shape of the curve, the frequency 
scale is arbitrary. 

G,/w is normalized to eliminate N,,/x, . A plot of normalized G,/w 
obtained from experiment is also shown in Fig. 33 for comparison. 

The shape of the curve calculated from the tunnelling model is 
determined essentially by (102). The shortest time constant in this 
distribution is when £ = 0 which is right at the interface. G,/w drops 
off rapidly as the period of the applied signal becomes comparable 
to and shorter than this time constant. When the period of the applied 
signal is long compared to the shortest time constant in (102), G,/w 
becomes a constant as we have assumed a constant density of states 
into the oxide. The main purpose of assuming a uniform density of 
states into the oxide is to illustrate the asymmetry of the G,/w vs log w 
curve introduced by the tunnelling time constant distribution (102). 
The G,/w vs log » curve will go through a maximum if the density of 
states decreases with distance into the oxide. As long as the density 
of states decreases more slowly with distance into the oxide than the 
time constants given by (102) increase, G,/w will decrease faster on 
the high frequency side of the peak than on the low-frequency side. 





NORMALIZED Gp/w 





FREQUENCY (ARBITRARY SCALE) 


Fig. 833— Normalized G,/w vs log frequency for tunnelling calculated from 
(104) and normalized Gp/w vs log frequency obtained for measurement. Frequency 
scale is arbitrary to compare the shapes of two curves. 
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The G,/w vs log w curve obtained from measurement is symmetric 
about its peak. A distribution of states into the oxide which would 
would give a G,/w vs log w curve symmetric about its peak using the 
tunnelling model would be a peaked density distribution having its 
maximum some distance into the oxide. In addition, the width ofthis 
distribution would have to be considerably smaller than 1/2K, to get 
G,,/w vs log w curves in the frequency range between 50 Hz and 500 kHz. 
Such a distribution would be rather arbitrary. Thus, it can be seen 
from the symmetry of the observed G,/w vs log w curves that tunnelling 
to states distributed into the oxide is not likely to be the dominant 
cause of time constant dispersion in the samples measured. 


LIST OF SYMBOLS 


a Maximum amplitude of the ac surface potential in volts. 

b Maximum amplitude of applied ac signal in volts. 

C Equivalent parallel capacitance of interface state branch 
of equivalent circuit having a single time constant in 
farads/cm’. 

CB Representation of conduction band. 

Co Depletion layer capacitance in farads/cm’. 

C; Inversion layer capacitance in farads/cm’. 

C. Measured equivalent parallel capacitance of the MIS 
capacitor in farads/cm’. 

Ci Electron capture probability in em*/sec. 

Can Low-frequency capacitance of MIS capacitor in farads/cm’. 

Ce Oxide layer capacitanec in farads/cm’. 

Cy Hole capture probability in em*/sec. 

C, Kquivalent parallel capacitance of depletion layer and 
interface state branch of equivalent circuit in farads/cm’. 

Cy Interface state capacitance associated with a single time 


constant in farad/cm’. 

dv Number of characteristic areas in which the number of 
surface charges is between N and N + dN or the surface 
potential is between u, and u, + du, . 


Ey Fermi level in units of &7'/q. 

E, Mid-gap energy in units of kT’/q. 

Cn) €p Emission constants for electrons and holes in sec’. 

de Maximum value of the perturbation of the Fermi function 


by the ac signal. 
Ts Fermi function at the de bias. 


Lec 
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Fermi function as a function of time. 

Measured equivalent parallel conductance of MIS capac- 
itor in mhos/ecm’. 

Emission rate of electrons from a single level interface 
state in em™*-sec™’. 

Equivalent parallel conductance of the interface state 
branch of the equivalent circuit in mhos/cm’. 

Equivalent parallel conductance corrected for C’,, in mhos. 
Emission rate of holes from a single level interface state 
in em™*-sec*. 

Majority carrier capture conductance of a single level 
interface state in mhos/cm’. 

Current density charging the interface states in amp/cm’. 
Current density charging the space-charge layer in amp/cm’. 
Current density through MIS capacitor in amp/cm’. 
Boltzman’s constant in eV-coul/°K. 

Number of ionized acceptors in a characteristic volume. 
The mean of M. 

Number of built-in charges and charged interface states 
in a characteristic area. 

The mean of N. 

Acceptor impurity density in em7’. 

Mean ionized acceptor density in em™’. 

Donor impurity density in em’. 
Intrinsic carrier concentration in em™. 
Density of a single level interface state in em™”. 

Electron density established by the de bias in cem™’. 
Density of interface states, dN,/dy in cm™*-eV ’. 

Time dependent electron density at the silicon surface 
in cm™*. 

Probability that there are JZ ionized acceptors in a char- 
acteristic volume. 

Probability of finding N charges in a characteristic volume. 
Probability that the surface-charge density is Q in em*/coul. 
Hole density established by the dc bias in em™’. 

Time dependent hole density at the silicon surface in cm™. 
Probability that surface potential in a characteristic area 
IS U, . 

Electronic charge in coulomb. 

Sum of built-in and interface state charge densities in 
coul/cm’. 
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Mean surface-charge density in coul/em’. 

Fixed charge density in the oxide in coul/cm’. 

Total interface state charge density in coul/cm’. 

Silicon space-charge density in coul/em’. 

Total charge density in coul/cem’. 

Electron and hole capture resistance of a single level in 
ohms-cm’. 


R,(t), R,(t) Capture rate of electrons and holes by a single level interface 


Rk, 


t 
T 


U 


state in em~*-sec™’. 


Majority carrier capture resistance of a single level inter- 
face state in ohm-cm’. 
time in seconds. 
Absolute temperature in °K. 
Potential difference between mid-gap and an interface 
state level in units of kT’/q. 
: In (n;/Np), for n-type silicon 
= In (V,/n;), for p-type silicon. 
Potential difference between mid-gap and Fermi level at 
silicon surface in units of kT'/q 
In w/¢,N; « 
Surface potential in units of k7T’/q. 
Mean surface potential in units of k7'’/q. 
Representation of valence band. 
Average thermal velocity of holes and electrons in cm/sec. 
Time dependent voltage applied to MIS capacitor in volts. 
de bias across MIS capacitor in volts. 
Arbitrary de bias in volts. 
Depletion layer thickness in cm. 
Mean oxide thickness measured from the interface in cm. 
In wr, = Ug tu, — U,. 
tu, — Us + In 2.5. 
Admittance of a single level interface state in mhos/cm’. 
Admittance of an interface state continuum in mhos/cm’. 
MIS capacitor impedance less reactance of the oxide layer 
in ohms-cm’. 
Characteristic area, em? 
= 38.5 g/kT in volt’. 
Additive constant in volts. 
Time dependent part of Fermi function. 
Incremental electron surface-charge density induced by 


6v, in em”. 
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V4 ac signal applied to MIS capacitor in volts. 

oy, Silicon band bending by ac perturbation in volts. 

si» €or Dielectric permittivity of silicon and SiO, layer respectively 
in farad/em. 

gE Distance into oxide measured from interface in cm. 

Op Standard deviation of surface potential caused by fluctua- 
tions of ionized acceptors. 

ae Electron and hole capture cross sections in cm’. 

O, Standard deviation of surface potential caused by fluctua- 
tions of surface charges. 

a Standard deviation of surface potential caused by oxide 
thickness fluctuations. 

T Majority carrier time constant of a single level interface 
state, R,C’,, in seconds. 

Tm Measured interface state time constant in seconds. 

Tmo Time constant at @, = wUz. 


ea aa Electron and hole time constant for a single level in seconds. 


y Energy in electron volts. 

Ve (kT /q) ug in volts. 

Vv, Band bending or surface potential in volts. 

VY, Mean surface potential in volts. 

Wee Surface potential plus an additive constant in volts. 
Veo Surface potential at the de bias in volts. 

w Angular frequency = 27f in sec™'. 
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The Nonlinearity of the Reverse Current- 
Voltage Characteristics of a p-n Junction 
Near Avalanche Breakdown 


By S. M.SZE and R. M. RYDER 


(Manuscript received January 23, 1967) 


For nonlinear applications such as high-speed switching, a device figure 
of merit ts y, the ratio of the second derivative to the first derivative of the 
current-voltage (I-V) characteristic, or y = (d°I/dV’)/(dI/dV). At room 
temperature, the value of y for an ideal forward-bias Schottky diode is 
about 40 V~*. It is shown that although the ideal reverse breakdown char- 
acteristic could give a value of y greater than 40 V~*, because of the statistical 
distribution of impurities, the effect of space-charge resistance, and other 
complications, much lower values of y are expected. Furthermore, the 
nonlinear characteristic is noisy, relatively slow, and causes some power 
consumption. It appears, therefore, that this nonlinearity 1s not lrkely to 
supersede Schottky barrier diodes in high-speed switching applications. 
It does not, however, rule out the possibility of microwave generation 
application. 


For nonlinear applications such as high-speed switching, a device 
figure of merit is y, the ratio of the second derivative to the first 
derivative of the current-voltage characteristic. The value of y is a 
measure of the degree of nonlinearity, normalized to the operating ad- 
mittance level. It is used here to compare the nonlinearity of a 
reverse-biased diode near breakdown to that of a forward-biased p-n 
junction or a Schottky barrier. 

The current-voltage characteristic and y for a forward-biased p-n 
junction or a Schottky barrier are given by 


I = Ife"? — 1] (1) 


y= @1/dV?)/(dI/dV) = a @) 
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At room temperature, the value of y for an ideal Schottky barriers 
(n = 1) is about 40 V+ independent of bias. 

The present note is undertaken to answer the following question: 
Is it possible at room temperature to have a value of y greater than 
40 V+ for a p-n junction near its avalanche breakdown voltage? The 
answer is “yes” for an ideal isothermal breakdown characteristic and 
“no” for practical considerations. 

Near avalanche breakdown the isothermal current-voltage charac- 
teristic without space-charge effect is given by 


Io 
Ww z ? 
i- | a exp| — (a — 6) ae’ | a 
0 0 


where M is the multiplication factor, W the depletion width depend- 
ing on the applied voltage, and « and £ are the ionization rates of elec- 
trons and holes, respectively. 

The value of y computed from (8) is plotted in Fig. 1 [curve (a) ] as 
a function of M for a silicon p*+n junction with n-type background dop- 
ing of 5 X 10'* em™ and a breakdown voltage of 19.48 volts.? It is clear 
that the value of y exceeds 40 V-? at M & 65 corresponding to a volt- 
age of 19.48 volts approximately 50 mV (= AV) smaller than the 
breakdown voltage. For a background doping of 1.5 X 10%7 em”, the 
breakdown voltage is 9.546 volts.? The value of y exceeds 40 V~ at 
M = 50 [curve (b)], corresponding to a voltage some 50 mV smaller 
than the breakdown voltage. Similar results are obtained for other 
dopings and different semiconductors. 

Now, let us consider the space-charge effect. The space charge of 
holes and electrons produced by the avalanche generates a counter 
emf which reduces the field across the multiplying region.’ Since the 
voltage reduction is proportional to the current, it is represented as a 
resistance; furthermore, it increases as the square of the length of the 
region in which the space charges accumulate.* For the above devices 
[curve (a)] with an n region approximately equal to the depletion 
width at breakdown, and a circular area of one mil in diameter, the 
equivalent space-charge resistance Rs, is about 50 ohms. The incor- 
poration of the space-charge resistance will increase the applied 
voltage for any current level, thus, reducing the value of y at any 
given M. The computed result is also shown in Fig. 1 [curve (c)]. One 
notices that y reaches a maximum value of about 500 V+ at an M 
= 10°. For the above device area with a saturation current density of 


i= )oM = 





(3) 
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M,MULTIPLICATION FACTOR 


Fig. 1—y versus multiplication factor. (a) p*n junction in silicon with back- 
ground doping 5 x 10° em“ and without space-charge effect. (b) p*n junction 
in silicon with background doping 1.5 x 10 em~ and without space-charge effect. 
(c) Same as (a) with 50-ohms space-charge resistance. (d) Same as (a) with 5000 
ohms space-charge resistance. 


about 10° amp/cm?, the current at M = 10° is still very small, of the 
order of nanoamperes. For larger M, y decreases rapidly to zero within 
10 mV. If one increases the width of the n region, Rs, also increases; 
eventually for large enough R,,, such as shown in Fig. 1 curve (d) the 
value of y will be smaller than 40 V~ for any J. 

The effects of statistical spatial fluctuations of donor and acceptor 
ions on the breakdown voltages have been considered by Shockley.® . 
It is found that this randomness leads to a characteristic fluctuation 
voltage of about 300 mV in silicon. The effect of these fluctuations in 
a p-n junction is to produce local regions with breakdown about a few 
hundred millivolts lower than the average in uncompensated material, 
and larger by [(Na + Na)/(Na — Na) |*” in compensated material. 

It is clear that this voltage fluctuation is an order of magnitude 
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larger than the value of AV which is the difference between the break- 
down voltage (M — oo) and the voltage at which M & 50. Conse- 
quently, in a practical junction with this effect present, the curvature 
will be very much less than the ideal calculation of Fig. 1. 

We conclude that the normalized curvature y could be large for a 
junction near breakdown, but only if the junctions were ideal. It 
would be possible for y to exceed 40/V provided space charge, heating, 
microplasma and statistical effects are small. This ideal high curva- 
ture would be limited to regions within a few millivolts of breakdown, 
where the multiplication ratio M is high, 50 or more. 

Practical considerations severely limit the use of this nonlinearity: 


(1) Because of statistical fluctuations in impurity distributions, there 
is a characteristic spatial fluctuation in breakdown voltage of a few 
hundred millivolts in silicon, as pointed out by Shockley.’ This alone 
drastically reduces y below the ideal value. 

(it) Series resistance, either ohmic or space charge, limits the 
region of high curvature to small currents, typically a few microam- 
peres for a 1-mil diameter junction. Therefore, with usual shunt 
capacitances, the response time tends to be slower than for a good 
Schottky Barrier diode. 

(v2) Microplasmas, if present, would cause erratic curvature and 
high noise. 

(iv) The noise is high compared to a forward-biased or zero-biased 
Schottky Barrier. So is the power dissipation. 

(v) Negative resistance complications are possible.® 


In summary, although the ideal reverse breakdown characteristic 
could give a value of y greater than 40 V~, because of the statistical 
distribution of impurities, the effect of space-charge resistance, and 
other complications above, much lower values of y are expected. 
Furthermore, the nonlinear characteristic is noisy, relatively slow, and 
associates with some power consumption. It appears therefore, that this 
nonlinearity is not likely to supersede Schottky Barrier diodes in high- 
speed switching applications. It does not, however, rule out the possi- 
bility of microwave generation application. This latter application is 
not discussed here; it would utilize not only the large nonlinear re- 
sistance of the I-V characteristic near breakdown but also the non- 
linear reactance and negative conductance due to the interaction be- 
tween transit time and the avalanche process in the device.® 
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Subjective Evaluation of Transmission 
Delay in Telephone Conversations 


By E. T. KLEMMER 
(Manuscript received February 1, 1967) 


An earlier experiment by Riesz and Klemmer on the effect of pure- 
transmission delay upon natural telephone conversations was extended in 
a test with more than double the time period and number of calls. The 
previous finding of little or no adverse reaction to round-trip pure delays 
of 600 and 1200 msec alone was confirmed. The previous finding of a large 
increase in dissatisfaction with both of these delays following exposure to 
2400 msec was not obtained. Exposure to delays of 2400 msec led to no 
dissatisfaction with later calls at 600 msec, but some rejections at 1200 msec 
did occur. There is no contradiction of other results on normal telephone 
circuits with 2-wire terminations (and related echo sources, paths, and 
suppressors) wherein customer dissatisfaction is greater with 600 msec 
delay than with the much shorter delay of a normal long-distance circutt. 


A previous paper by Riesz and Klemmer!? in this journal described 
laboratory experiments on the effect of transmission delay upon the 
quality of telephone circuits for normal conversation. These experi- 
ments were of two types: (7) “pure delay” tests in which long trans- 
mission times were employed, but the side effects of echo and echo 
suppressors were avoided by using special 4-wire telephone circuits 
and (i) ‘2-wire” tests which used long transmission times in normal 
2-wire circuits (or circuits with 2-wire terminations) with echo sources 
and echo suppressors. 

Since the publication of these experiments, several evaluations out- 
side the laboratory have been done on circuits with long transmission 
times and naturally-occurring telephone calls (e.g., Helder,? Klem- 
mer*). These studies have borne out the earlier laboratory finding of 
considerable dissatisfaction with 2-wire circuits for round-trip delays 
of 600 msec or more. The pure delay condition could not be evaluated 
in the field tests since it requires complete separation of the transmit- 
ting and receiving paths which is not available in the normal telephone 
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plant.* Even though the pure delay condition is not obtainable in 
the telephone network used by the public for domestic and over- 
seas calls, it is of interest because it represents a limiting point 
for the effect of transmission delay on the quality of the circuits 
for conversation. It is not likely that any changes in echo suppressors 
or other circuit devices would produce better transmission quality than 
a 4-wire system with the same transmission time. Thus, if the degrada- 
tion noted in the natural 2-wire circuits with long delay were present 
in similar amount on 4-wire circuits of the same delay, there would 
be little hope of improving transmission quality by improving echo 
control methods. If, on the other hand, the effects of pure delay alone 
are much less than that due to delays, echo, and echo suppression, 
then improvement in echo control methods is definitely indicated. 

The results of the previous laboratory study with pure delay were 
very limited in number of people and calls, and also confounded by 
the introduction of extremely long delays (2400 msec) in the middle 
of the experiment. The data had shown little or no dissatisfaction 
with pure delays of 600 and 1200 msec prior to the introduction of 
the 2400-msec delay, but showed an increasing rejection of circuits 
with the lesser delays after exposure to the 2400-msec condition. 

The chief purpose of the present experiment was to see if an in- 
creasing rejection rate would occur with continued exposure to pure 
delays of 600 and 1200 msec only. Therefore, the present experiment 
repeats the 12 weeks of the previous pure delay study but without the 
introduction of the 2400-msec delay. 

After the 12 weeks of alternate days of 600- and 1200-msec delay, 
periods of 1800- and 2400-msec delay were inserted to re-evaluate the 
effect that these longer delays would have upon the users reaction to 
following days of 600- and 1200-msec delay. 


I. SIMULATION APPARATUS: SIBYL 


The simulator called Sibyl, which permits the insertion of experi- 
mental circuits into naturally-occurring telephone calls, was the same as 
that employed in the previous study? and is described by Irwin.’ Elimi- 
nation of echo and echo suppressors is attained by converting all tele- 
phone instruments to full 4-wire operation, separating the transmit and 
receive paths. Normal sidetone was provided within each telephone set. 

*Other laboratory tests on the effect of pure delay upon conversational tasks 
have been reported by Bricker, Krauss and Bricker,} and Vartabedian.6 These 


did not involve naturally-occurring telephone calls, and thus, are not directly 
comparable to the tests described here. 
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II. SUBJECTS 


Twenty administrative employees of Bell Telephone Laboratories 
were selected on the basis of questionnaires sent to several hundred 
people asking about frequencies of telephone calls to other extensions 
in Bell Laboratories. These people were selected to form a group with 
a high reported-rate of calling each other since the delay circuits could 
only be used when they called each other. 


Ill, INSTRUCTIONS 


The participants were told that some of their calls would go over 
special experimental circuits. They were not told which calls would be 
affected or anything about the nature of the experimental circuits. 
They were told that if they found any circuit “unsatisfactory for 
normal telephoning” they should dial the digit “4” without hanging up 
or breaking the connection, and the standard circuit would be re- 
stored. The instructions called for the originating party to reject the 
circuit, but actually either party could reject the experimental circuit 
and the few instances of rejection by the called party were also 
counted as rejected calls. 


IV. EXPERIMENTAL DESIGN 


The schedule of delays on the experimental circuits was as follows: 


Weeks 1through12 600 and 1200 msec on alternate days 
Weeks 13 through 14 1800 msec every day 
Weeks 15 through 16 2400 msec every day 
Weeks 17 through 26 600 and 1200 msec on alternate days. 


The delay for the day was inserted on each call made by one subject 
to another subject unless the simulator was already in use. No calls 
involving other stations could be put over the experimental circuits 
(because of the 4-wire requirement), and therefore, only a small per- 
centage of any subject’s calls went over the experimental circuits. The 
subjects were not told of this limitation and none reported knowledge 
of it in the post-test interviews. 


V. RESULTS 


The percentage of calls rejected for each two-week period of the 
experiment is plotted in Fig. 1 for each delay separately for the first 
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Fig. 1— Rate of rejection of circuits as a function of weeks of the experiment 
from Riesz and Klemmer® and for the present experiment. Combined data from 
18 and 20 subjects, respectively. 


12 weeks of the test. The data from the previous study? is also shown. 
Clearly, the rising rejection rate of the previous study was not found, 
indeed, there were no rejections at either 600 or 1200 msec for weeks 
7 through 12. In the first 12 weeks of the present study, a total of 527 
calls were made over the delay circuits. This compares with 323 calls 
made during the entire earlier study. Thus, it is clear that increasing 
rejection of pure delays of 600 and 1200 msec is not to be expected 
from repeated exposures to these delays only. 

The results during and after the longer delays in the experiment 
are also shown in Fig. 1. Two weeks at 1800 msec resulted in 5 percent 
rejections (3 calls of 60). Two weeks at 2400 msec resulted in 27 percent 
rejections (14 calls of 52). Six different people rejected calls at 2400 
msec. Eighteen of the 20 subjects talked over the 2400-msec delay, 
and these 18 people made 97 percent of the calls over experimental 
circuits during the final 10 weeks when 600- and 1200-msec delays 
were again used on alternate days. Thus, the data following 2400-msec 
delay comes almost entirely from people who had been exposed to 
2400 msec. 
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Because of vacations and illnesses there are not enough data during 
the final 10 weeks to plot biweekly points. Samples of comparable size 
to the first 12 weeks are obtainable by using two 5-week periods and 
these points are shown in Fig. 1. No additional rejections of the 600- 
msec circuit occurred during this period. Thus, there was no “sensi- 
tization” to that delay. The 1200-msec condition led to 12 percent 
rejection (7 of 57 calls) following exposure to long delays. (Note that 
in Fig. 1 the small difference in percentages between the two final 
periods at 1200 msec is not statistically meaningful.) The occurrence 
of some rejections at 1200 msec following exposure to the longer delays 
indicates some sensitization since 1200 msec had zero rejections from 6 
weeks (94 calls) prior to the long delays. 

Telephone interviews were conducted with the 16 subjects who were 
available at the end of the test. They were asked if they, or the people 
with whom they talked, had difficulty in talking or hearing on calls 
within the PBX during the time of the test, or if they noticed anything 
different about any calls. Hight (half) of the respondents said they 
had no difficulty and noticed nothing different about their calls. Hight 
reported difficulties due to: low volume (2); slow answer (2); fading 
(1); fuzziness (1); noise (1); and another party could not hear (1). 
One person, in addition, reported hearing echo (perhaps due to an air 
path feedback between receiver and transmitter). Of those who re- 
ported difficulty, one rated the condition not objectionable, two mildly 
objectionable, one moderately objectionable, and three seriously ob- 
jectionable. One of those reporting seriously objectionable difficulty 
had never rejected a delay call, but several times dialed “4” on normal 
calls. He said the trouble was loss and dialing “4” did not help. 

The interview data cannot be taken as a reliable measure of the cir- 
cuit quality for three reasons: (tz) The questions related to a large 
population of calls, only a few of which were actually over the delay 
circuits, (7%) The test lasted several months, and the subjects could 
hardly be expected to sort out accurately and remember all individual 
ealls, and (27) The subjects reported difficulties and attempted rejec- 
tions on normal calls not involving the delay circuit at all. The inter- 
views do, however, show that most of the time the participants were 
not aware that there was anything different about their connections 
when delays of 600 or 1200 msec were inserted. Indeed, eight people 
reported that they noticed nothing different about their circuits even 
though this group had actually talked on a total of 85 calls with 1800- 
or 2400-msec delay and many more calls at the smaller delays. 
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VI. DISCUSSION AND CONCLUSIONS 


. The first 12 weeks of the present study repeated the procedure of 
the previous pure delay tests except that the intermediate exposure 
to 2400 msec was omitted. Under these conditions of exposure to only 
600 and 1200 msec, no increased dissatisfaction with delay circuits 
developed even though a much larger number of delay calls were made 
in the new study. This result alone obviously implies that exposure to 
the 2400-msec delay was responsible for the later rejections of the 600- 
and 1200-msec conditions in the previous study. The results of the 
later weeks of the present study provide limited support for this 
hypothesis, however. After exposure to 1800- and 2400-msec delay, no 
rejections of the 600-msec condition occurred but 12 percent of 
the 1200-msec calls were rejected. In the previous study, after ex- 
posure to 2400-msec delay, the 600-msec condition was rejected in 25 
percent of the calls, and the 1200-msec condition was rejected in 43 
percent of the calls. 

In view of the differences in results between the two studies regard- 
ing the influence of exposure to 2400-msec delay, it might be best to 
withhold judgment about the magnitude of the sensitization effect. 
There is no disagreement, however, on the more direct and important 
question about user’s reaction to pure-transmission delays of 600- and 
1200-msec round-trip delays when these are not confounded with the 
longer delays or speech-operated devices. Users are very seldom 
disturbed by these pure delays as is indicated by the fact that dur- 
ing the second 6 weeks of the present study the participants completed 
more than 200 calls without a single rejection. 

This conclusion is supported by the field test results” * which show 
for round-trip delays of 600 msec (on 2-wire circuits with echo sources 
and echo suppressors) that less than 1 percent of the people inter- 
viewed immediately after a call over the delay circuit said anything 
which implies an awareness of the delay itself. This is true despite 
the fact that 25 percent or more of the respondents report some diffi- 
culty in talking or hearing on the 600-msec circuit and only half that 
many report difficulty on circuits with delays less than 100-msec delay 
(normal overseas cable circuits). 

Although users are not aware of a transmission delay of 600 msec 
it is clear that the delay must affect the conversational patterns in 
such a way as to cause other types of difficulty in actual 2-wire cir- 
cuits, difficulties such as speech mutilation by echo suppressors. In 
addition, there is evidence that for tasks other than naturally occur- 
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ring conversation, delays of 600 to 1800 msec may significantly lower 
performance (Krauss and Bricker,> Vartabedian*). Studies of speech 
dynamics under various transmission delays are underway to under- 
stand this effect better. 
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The Effect of Intersymbol Interference on 


Error Rate in Binary Differentially- 
Coherent Phase-Shift-Keyed 
Systems 


By W. M. HUBBARD 
(Manuscript received March 7, 1967) 


Two types of binary differentially-coherent phase-shift-keyed signals 
(designated AM-DCPSK and FM-DCPSK) which look attractive for 
high-speed digital communication systems are considered. The error rate 
as a function of signal-to-noise ratio is calculated for each type of signal. 
For the AM-DCPSK signal the effects of intersymbol interference from 
adjacent time slots, phase distortion in the pulses, nonideal delay lines 
in the differential phase detector and nonideal regeneration (in a sense 
described in the text) are considered. For the FM-DCPSK signal the effects 
of nonideal regeneration and of a degradation parameter 6 are considered. 
The parameter 6 can be readily associated with phase distortion and nonideal 
delay lines in the differential phase detector. By means of a straightforward 
but tedious calculation ii can be related to intersymbol interference if the 
transfer characteristics of the channel are known. The results of the calcula- 
tions are presented, in graphical form, for wide ranges of signal-to-noise 
ratio and of the parameters which describe the intersymbol interference 
and nonideal regenerator performance. Error rates from 107*° to 10~* are 
considered. 


I. INTRODUCTION 


This paper presents a summary of calculations which were per- 
formed before and during the construction of a 300-Mb/s repeater for 
a guided millimeter-wave communication system. Consequently, the 
problems which are considered are oriented toward problems which 
arise in connection with these high-speed systems, e.g., finite-width 
decision thresholds, imperfect phase shifts in the modulators, etc. 
Because of the nature of the channels envisioned for these systems, only 
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intersymbol interference from adjacent time slots is significant and the 
treatment of intersymbol interference will include only adjacent time- 
slot interference. 


1.1 Summary of Previous Results 

Several authors!*:*4 have calculated the error rate as a function of 
signal-to-noise ratio, S/N, for an ideal differentially-coherent phase- 
shift-keyed (DCPSK) system, i.e., a system in which intersymbol 
interference can be ignored and in which regeneration is assumed to be 
ideal. The well-known result is 


Ty = 3 exp (—S/N), (1) 
where II is the probability of error for the ideal case. The author® has 
considered the effects of nonideal regeneration. Sections II and III of 
this paper extend these error-rate calculations to include the effects of 
intersymbol interference for two types of DCPSK signals. These 
signals are discussed and compared in the next paragraph. 


1.2 Comparison of AM-DCPSK and FM-DCPSK Signals 


In a DCPSK system the information is carried in the phase of the 
signal at the sampling point in one time slot relative to the phase of 
the signal at a time, 7, carlier where 7’ is the reciprocal of the bit 
rate B. This phase change can be accomplished in several ways; the 
effect of intersymbol interference depends on how it is accomplished. 
In the following, two types of modulation which can be thought of as 
limiting cases (in a sense that should become clear in the following 
discussion) will be considered. 

The first type to be considered consists of a sequence of amplitude 
modulated RF pulses occurring at the bit rate, B. The information is 
carried in whether the relative carrier phase between adjacent pulses 
is 0 or zx. Since a phase shift of r radians is equivalent to a change in 
sign, the signal can be written in the form 


S() = > a,S(t — nT’) exp (jwol), (2) 


where a, = +1 or —1 according to whether the phase in the nth time 
slot is 0 or z, respectively. So(t — n7J’) is a pulse-shaping term which 
reaches its maximum value at ¢ = nT. Intersymbol interference arises 
from the fact that So(t — nT) is not confined to a single time slot. 
Fig. 1(b) is an example of this type of signal. 

Since the signal S(t) in (2) is in fact a pure AM signal es though 
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Fig. 1—(a) Unmodulated IF-carrier for AM-DCPSK; (b) idealized AM- 
DCPSK signal; (c) unmodulated IF-earrier for FM- DCPSK; (d) idealized FM- 
DCPSK signal carrying the same information as in (b); (e) frequency vs time 
for the signal in (d). 


the information is recovered by comparing phases) this type of modu- 
lation can be designated AM-DCPSK in order to distinguish it from a 
second type which will be described below. Error rate as a function of 
S/N for an AM-DCPSK signal with intersymbol interference and non- 
ideal regeneration is calculated in Section II. 

The second type of modulation consists of a constant amplitude 
signal which shifts phase between sampling points by means of a 
frequency swing. This signal, which can be designated FM-DCPSK, 
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can be written in the form 


t nT 
S() = exp io +f a) ar, where f w(t) di =a, (3) 
0 (n-1)T 


and a, is a chance binary variable (which contains the information 
being transmitted) that can take on any two values which differ by =. 
In practice, the values -+-7/2 and —7/2 offer certain advantages so the 
following discussion will assume, for clarity, that a, = xk2/2. This 
assumption is unnecessary for the calculation and does not influence 
the result in any way. An example of this type of signal is given in 
Fig. 1(d). 

Error-rate vs S/N for an FM-DCPSK signal with intersymbol 
interference and nonideal regeneration is calculated in Section III. 


1.3 The Differential Phase Detector 


Both AM-DCPSK and FM-DCPSK signals can be detected using a 
product demodulator of the type shown in Fig. 2. For this device to 
function properly, the intermediate frequency fo and the bit rate, B, 
must be related according to 


ie = mB m= 1, 2, 3, one for AM-DCPSK (4) 
and 
fo = 3(m +3B m= 1,2,3,-+. for FM-DCPSK. (5) 


When these conditions are satisfied for the appropriate type of modu- 
lation, the output of the differential phase detector will be pro- 
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Fig. 2— Differential phase detector. 
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portional to V, where 


V = [uu + v)* — (u — v)’] = w cos y. (6) 


Here u is a vector which represents the amplitude and phase of the 
signal at time ¢, v the amplitude and phase at time t—T, and y is the 
phase difference between times ¢ and t—T. 


1.4 The Regenerator 


The output of the differential phase detector is fed into the regenera- 
tor where it is sampled at a particular point in each time slot. An ideal 
regenerator would regenerate a +1 if cos y > 0 and a —1 if cos 
yw < 0 regardless of how small | cos y| might be. Since no realizable 
regenerator will accomplish this, we take as a model of a regenerator a 
device which regenerates +1’s and —1’s according to the following 
inequalities: 


+1 if 12 cosy e (7) 
—1 if -eZ cosy2= —1 


+1 or —1 randomly and with equal probability if | cos y| < « 
It is convenient to define a threshold 7 in terms of the parameter « 
by means of the relation 


S/T = —20 loge dB. 


The quantity S/T is the so-called signal-to-threshold ratio and repre- 
sents the ratio of the expected value of signal power to the minimum 
value of signal power which will cause the regenerator to function 
reliably (in the absence of noise). 

In high-speed systems the signal-to-threshold ratio is limited by 
practical considerations (at the present state of the art) to values of 
the order of 10 dB or less. 


II. ERROR-RATE WITH AM~-DCPSK MODULATION 


2.1 Intersymbol Interference 


The voltages at the two output ports of the second quadrature 
hybrid in Fig. 2 will at any instant consist of contributions from the 
following sources: 


(1) The two pulses being compared. 
(72) Intersymbol interference from other pulses in the channel. 
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(72) Interchannel interference from other channels propagating in 
the medium. 
(iv) Noise. 


The following assumptions are made: 


(1) The noise is Gaussian and the noise on adjacent pulses is 
statistically independent.* 
(zi) Interchannel interference is negligible. 
(7) Intersymbol interference comes only from adjacent pulses. 
(tv) The sampling is accomplished instantaneously. 


Since the phases of these output voltages are, in general, different 
we must represent these voltages by vectors (in a plane). Consider four 
adjacent pulses labeled LABR from left to right. The pulses labeled 
A and B are the two whose phases are to be compared. The ones labeled 
L and R are significant because they contribute to the intersymbol 
interference. Let L, A, B, and R be vectors of unit magnitude which 
lie along the +X or —X direction. Let p, represent the ratio of S,(7’) 
to S,(0) and p, the ratio of So(—T) to S,(0) where S)(t) is the quantity 
introduced in (2). Let a and b represent the noise on pulses A and B, 
respectively. 

The outputs r, and r, of the two output ports of the second quadrature 
hybrid are then 


m= Statb=ud+yv (8) 
r= D+a-—-b=u-v, (9) 

respectively, where 
S=A+B+ p(B+R)+p(L+A) (10) 
D = A-B+ p(B —R) + p,(L — A). (11) 


For a given pulse pattern S and D are determined. The quantities a and 
b represent four independent Gaussian variables with zero means and 
equal variances. 

Consider first the means of the distributions of r, and r, . Since a and 
b are Gaussian variables of zero mean these means are determined for 
a given pulse pattern by S and D, respectively. 

There are 2* = 16 possible patterns for the four pulses, L, A, B, and 

* The latter assumption is never strictly true since the noise is band-limited. 


It has been shown,* however, that the effects of this correlation on error rate are 
negligible unless the noise bandwidth is smaller than about 1.4 times the bit-rate. 
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TABLE I 
3 (St — D’) = 3 PPa 
Case L A B R 38 3D (neglecting terms in p*) 
1 1 1 1 1 1+ pa + pr 0 1 + 2p, + 2p, 
2 1|1 1/—-1/1 + pr Pa 1 + 2p, 
3 —1li)1 1 1] 1+ pa — pr 1 + 2p, 
4 —-1)1 1/-1l/1 Pa — Pr 1 
5 1j)1 —l 1 pr| 1 — pa —1 + 2p, 
6 1/1])-1/] -1 —patpr}1 —l 
7 —1]/1]-l1 1 0 1 — pa — pr —1 + 2p, + 2p; 
8 —-1l/1 —1] -1 — pa 1 — pr —1 + 2p, 











R. Of these, eight can be obtained by reversing all of the phases in 
each of the other eight patterns. Since such a sign reversal has no 
effect on the error probabilities to be considered only eight patterns 
need be considered. These are numerated in Table I. 

We now consider the output of the differential phase detector for 
an arbitrarily chosen pulse pattern. The criterion for making the 
decision as to whether the pulses are of the same phase or of opposite 
phase is that of determining whether 

Ve {Inf -[n 7} (12) 
is positive or negative. This criterion is equivalent to deciding whether 
the phase angle between the received pulses (including crosstalk and 
noise) is less than 90 degrees or greater than 90 degrees, respectively. 
It is worth mentioning that due to the correlation between the signals 
in the sum and difference arms 0 would in general not be the proper 
decision level if the phases of the pulse tails relative to the pulse peaks 
were fixed and known. Since this is probably not going to be the case 
in any reasonable system, the decision level will be taken at 0 in 
this calculation. (Zero is the optimum decision level for random phase 
in the tails.) 

With the assumptions that have been made, the error probability 
including the effects of intersymbol interference can be determined by 
a straightforward extension of the calculation due to Bennett and 
Salz.? Substituting (8) and (9) into (12) gives 


V= (P + «)(Pa + La) + yYa , (13) 
where 

P=S+0D P, = § =D 

z= 2a, Lg = 2b, 


y = 2a, Ya = 2b,. 
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Equation (18) is identical in form to (54) of Ref. 3. However, the 
symbols now include the effects of intersymbol interference. Following 
the method of Bennett and Salz* one obtains for the probability of 
error 


1 (- PP: ) ae, 
i w Jo ah a (P; cos’ 6 + P* sin’ 6) (14) 


It must be recalled that P and P, are pulse-pattern dependent. For a 
random message the error rate is obtained by averaging the expression 
in (14) over the eight possible values of P and P, . 

In order to determine the worst possible error rate for any arbitrary 
message, we can evaluate II for the worst pulse pattern. From Table 
I it is apparent that this is Case 7 if there is no phase distortion in the 
pulse tails. In any event this case represents an upper limit on the error 
rate. Fortuitously, since S = 0 in this case, P? is equal to Pj and the 
integral for II becomes particularly simple. 


-i/" ( F) ok | A= a= a)" 
=r), oP\—az do = 5 exp 96? : (15) 


Thus, the effects of intersymbol interference can be treated, for the 
worst pulse pattern at least, as a degradation of the signal-to-noise 
ratio by the amount 


20 log (1 — p. — p,) dB. 


The extension of this calculation to the case where both intersymbol 
interference and a finite-width decision threshold’ are present is 
straightforward. One replaces the integral in (90) of Ref. 3: 


[ de i: [ p(—z | x, y)g(@, y) dx dy, 


by the sum of two integrals: 
1 4e Ce) ee) 
2 a - i, [ p(—2z | 2, y)g(a, y) dx dy 


ae or 


These integrals are then evaluated by the method used in Appendix A 
of Ref. 5. Figs. 3, 4, and 5 show error rate as a function of signal-to- 
noise ratio in the case where p, equals p, in terms of signal-to-threshold 
ratio and crosstalk per tail. In these figures pr = pa = pr is expressed 
in dB. 
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SIGNAL -TO-NOISE RATIO IN DECIBELS 


Fig. 3— Probability of error vs signal-to-noise ratio for S/T = 15 dB AM-DCPSK, 
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Fig. 4 — Probability of error vs signal-to-noise ratio for S/T = 9 dB AM-DCPSK. 
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Fig. 5 —~ Probability of error vs signal-to-noise ratio for S/T = 6 dB AM-DCPSK. 
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The scale of the ordinate in Figs. 3, 4, and 5 is chosen such that a 
plot of I) vs S/N from (1) gives a straight line. One finds from in- 
spection of Figs. 3, 4, and 5 that even in the presence of intersymbol 
interference and nonideal regeneration, this linearity persists. The 
slope of the line does change, however. Fig. 6 shows the slope, p, of 
these lines as a function of S/T for various values of pr. Here the 
slope, y, is defined such that » = 1 for the ideal case [see (1)]. 


2.2 Phase Distortion 


In an AM-DCPSK system there are two important types of phase 
distortion which are readily treated by a modification of the foregoing 
calculation. One type is representable by a phase shift of the pulse in 
the nth time slot by 6 degrees relative to the phase which should have 
been transmitted in that time slot. In this case the vectors B, and p,B, 
in (10) and (11) are rotated an amount @. This could arise, for example, 
from an improperly balanced pulse modulator. In the other type of 
phase distortion all of the delayed pulses in the differential phase de- 
tector are shifted an amount ¢ relative to their proper value. In this 
case the vectors B, p,A, and p,R are rotated an amount y. This could 
arise, for example, from a delay line of improper length in the differential 
phase detector. 

The analysis of these effects constitutes a straightforward extention 
of the calculation in paragraph 2.1 and only the results will be given 
here. Fig. 7 shows the degradation in error-rate performance which 
results from a phase shift 8 for pp = —o, —26, and —20. This effect 
is virtually independent of S/T for S/T > 6 dB. Fig. 8 shows the 
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Fig. 6 — Slope, u, of error probability vs S/N curve as a function of 8/T. 
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Fig. 7 — Degradation in signal-to-noise ratio due to a phase shift, 8, in one of 
the received pulses. 


degradation for phase shift y. The result is virtually independent of 
S/T for S/T > 6 dB and of pr for pp < —20 dB. 


III. ERROR-RATE IN AN FM-DCPSK SYSTEM 


In an FM-DCPSK system, it is useful to include a limiter in the 
receiver after the noise has been added. Therefore, the following calcu- 
lation assumes that an ideal limiter is used. By ideal limiter is meant 
a device which receives at its input the signal 


A(t) exp [je())] 


and at its output delivers the signal 
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Fig. 8— Degradation in signal-to-noise ratio due to improper delay line length. 


Ao exp [jp(#)]. 


Since the amplitudes wu and v in (6) are constants after limiting, the 
output of the differential phase detector is simply 


V =cosy 


and the error probability is directly related to the probability density 
of y. 

The intersymbol interference is assumed to manifest itself in the 
form of a perturbation in the phase of the signal at the sampling time. 
More precisely, the intersymbol interference (in this model) introduces 
a (pattern dependent) phase shift 6, so that the phase change in the 
nth time slot is +(7/2 + 6,) instead of 7/2. 

The value of 8, (for each distinct pulse pattern) depends on the 
details of the signal waveform and the transfer function of the devices 
in the system. The determination of the 8,’s for particular systems is 
beyond the scope of this paper. For a discussion of this problem see, for 
example, Rice and Bedrosian.? This paper concerns itself with the 


INTERSYMBOL INTERFERENCE 1163 


effect on error rate of a particular value of 8, . In order to apply these 
results to the performance of a particular system, one needs to com- 
pute the values of 8, for the various pulse patterns and then average 
these computed error rates over the possible pulse patterns. 

The parameter 8 can also be used to investigate the effects of phase 
distortion and nonideal delay lines in the differential phase detector 
just as 8 and gy were used in the AM-DCPSK case. One need only 
associate 8 with the total (net) phase shift for these distortions (in- 
cluding that due to intersymbol interference). 

Let a and fo represent the phase shift due to intersymbol inter- 
ference (and any other degradation in phase) on the two pulses being 
compared. This situation is represented by the phasor diagram in 
Fig. 9. Following the method described in Ref. 5, one obtains for the 
probability density function of y 


p(y) = 3 exp (—1/o!) + © exp (—1/20") 


aw/2 
co bes cos (a — a) cos (a + ¥ — Bo) 
sae | SiGe = a) sin’ + ¥ ~ 82] da 


a/2 


1 
+ ga [cos @- a) cos (a + v — Bo) 





— | - (a — a) + si (aty— a | 


f COS (a = a) f Cos (a + ¥ — Bo) 7 1 
er ae er a a (16) 





Fig. 9— Phasor diagram for the differential detection process. 
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From the regenerator model (Paragraph 1.4) one observes that an 
error is made if cos y 2 « and there is a 50-percent probability of error 
if | cos y | < e«. The error probability, I, is therefore given for a trans- 
mitted signal which should result in y = 7 by 


n=f pats f owma+hf nwa, a 
where 
6=cos'’e OS 6572/2. 
Substituting (16) into (17) gives, after some simplification, 


_ 1 yu(z) x 4 2 
~ 9 dro? Lf i] (x) i (- ee dy dx, (18) 


yu(x) = V1 — x’ cos (vy — 6) + xsin & — 8) 
y(t) = —V1— 2’ cos(o + 6) +2sin(e + 4) 








where 


6 = a — fo, p=5- 0=sin'e. 

Now yu(x) and y,() are segments of (different) ellipses both of 
which have the following properties: They are centered at the origin, 
have their major axes along the line x = y, and are tangent to the 
lines 2 = 1, 2 = —1, y = 1, and y = —1. A typical pair of such 
ellipses is shown in Fig. 10. By symmetry the small area A in Fig. 10(a) 





(a) (b) 


Fig. 10 — Region of integration in (16). 
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is equal to the small area B. Therefore, due to the spherical symmetry 
of the integrand, the integration indicated by the limits -1 = 7 S 1, 
yi(x) S y S yu(x) [as shown by the shaded portion of Fig. 10(a) ] 
can be replaced by the limits which correspond to integrating over the 
region bounded by the upper ellipse and the line 6 = © and the region 
bounded by the lower ellipse and this same line [see Fig. 10(b) ]. 
Writing the integral in polar coordinates, one obtains a form which is 
easily integrated over r. When this is done, the following result is 
obtained: 


= 1 ‘ee { cos” (y _ 5) } 
ie L{ a DY oe = sine em ool” 


a die (a — sin Y + 4) sin 26] atpe 2) 


But these integrals are periodic in period x, therefore, the ©’s can be 
deleted. The error rate can then be written 


II = Pov + 6) + 3Po(e — 94), (20) 


where 


es ee i ( cos’ ® ) 
PoA®) = oF = On oF am eae) (21) 


This integral is not soluble in closed form. The integral 





x/2 2 

Pas = i exp (—228 = [1 + sin Sain a) do 
on Janse 20 

is soluble and is a good approximation to Py(®) over a wide range of 

values of o and &. It can be written 


2 2 * 
Pea) = 5 ex (—S*)1 (5) 

where Jy is the modified Bessel function of the first kind. A complete 
consideration of the accuracy of this approximation is quite tedious 
and will not be considered further because Po(y) itself is so readily 
obtained by numerical integration of (21). 

Fig. 11 shows P(y) for several values of S/N. Figs. 12, 18, 14, and 
15 show II(S/N) for S/T = oo, 12, 9, 6 dB, respectively, for § = 0, 
5, 10, and 15 degrees. 


1166 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1967 


PROBABILITY OF ERROR 


JT TNT TT 
ee ES SS 


‘aes 


PRICE 








2) 
~ 
Zz 
Ti 
o 
Q 
Go 


La 
=> 













~| 












cullen 


Hot 
: 











tf 





ne 
- 
E 














Sates 











°o 
Nh 


{ 
@® IN DEGREES ® IN DEGREES 


Fig. 11 — Numerical evaluation of the error-rate integral. 
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Fig. 12 — Probability of error vs signal-to-noise ratio for S/T = oo FM-DCPSK. 
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Fig. 13— Probability of error vs signal-to-noise ratio for S/T = 12 dB FM- 
DCPSK. 
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APPENDIX 
Description of the Operation of the Differential Phase Detector for an 
FM-DCPSK Signal 


The differential phase detector is shown in Fig. 2. For an FM- 
DCPSK signal with no noise or distortion the input is given by (3). 
One can readily show that the signals in the output ports of the 
second 3-dB quadrature hybrid are given by 


DG) 8 oes {0 Ee if ; w(t’) aw ia eas jont ae yee a . wo(") aw} 
Bit) = Fsin {ex + fe w(t’) aw’ — isin {ou —+ = w(t’) aw. 


If the detectors, mounted as shown in Fig. 2, are regarded as having 
square-law behavior, the output is given by 


Vi) « Bt) — A*() 
« 1 Gos jenr + [ w(t’) aw’ + terms in 2wé. 


The terms in 2wof are removed by the low-pass filters. We can thus 
write the basic equation of the differential phase detector for a FM- 
DCPSK signal as 


t 


V(t) = cos joor + w(t’) w'. (22) 


For use as a differential phase detector one chooses 7 = 7 = the 
reciprocal of the bit rate, and an IF such that wo7’ = (m+4)a where m 
can be any integer. This equation then becomes 


V(t) = sin { / al?) aw’ | 


which by (3) must be 


Vi) = sina, = +1 
at the sampling points t = nT. 
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Experimental Verification of the Error-Rate 
Performance of Two Types of Regenerative 
Repeaters for Differentially Coherent 
Phase-Shift-Keyed Signals 


By W. M. HUBBARD and G. D. MANDEVILLE 
(Manuscript received February 9, 1967) 


High-speed digital repeaters are being considered in the Bell System 
and elsewhere for .both long- and short-haul communication systems. This 
paper describes two devices which were built to serve as prototypes of the 
IF portion of a millimeter guided-wave communication system but which 
might serve equally well as the IF portion of the repeaters for an optical 
communication system or for microwave radio systems. These two proto- 
types, designed for a differentially coherent phase-shift-keyed (DCPSK) 
signal, have been built and operated at a bit rate of 160 Mb/s using an 11.2- 
GHz IF signal. Both models operated with error rates very close to those 
predicted theoretically. One of the models seems to be particularly suitable 
for such systems; it consists of comparatively simple circuitry, and its 
operation is within 0.5 dB of the theoretical ideal behavior for a DCPSK 
system. 


I. INTRODUCTION 

High-speed long-haul communication by means of millimeter waves 
transmitted in the circular electric mode through a multimode circular 
waveguide was described by 8. E. Millert in 1954, and subsequently 
considered in some detail by Rowe and Warters.? This paper describes 
two experimental models of the IF portion of a repeater for such a 
system and compares their performance with theoretical predictions 
of error rate. These models could serve equally well as IF sections of 
repeaters for optical communication systems or microwave radio 
systems. 

No attempt is made in these models to equalize the delay distortion — 
of the medium, and no allowance is made for degradation from up 
converters, down converters, and millimeter-wave circuitry. 
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Both experimental model repeaters operated at a bit rate of 160 
Mb/s and used an IF of 11.2 GHz. The bit rate and the IF were arbi- 
trary choices made for convenience. In an actual system, a somewhat 
lower IF would be chosen to facilitate building solid-state amplifiers 
and a somewhat higher bit rate would probably be desirable. Since 
these model repeaters were to serve as prototypes for an even higher 
bit rate repeater, no components were used which did not seem capable 
of being modified to operate up to about twice this bit rate. 

Section II discusses briefly the two particular choices of modulation 
scheme which were used, and describes some of the features common 
to both models. Section III describes the so-called AM-DCPSK 
repeater and Section IV the FM-DCPSK one. 

These models demonstrated that a repeater which performs with 
error rates quite close to those predicted theoretically can be built. 


II. DIFFERENCES AND SIMILARITIES OF THE TWO MODELS 


The final version of a millimeter-wave repeater would almost cer- 
tainly be an all solid-state system. This requirement imposes a limita- 
tion on the maximum power available, especially at millimeter-wave 
frequencies. Because of this power limitation, which generally mani- 
fests itself as a limitation on peak power, it is necessary to use a 
modulation scheme which affords good noise immunity. Optimum 
noise immunity (in a binary system) is obtained when the two signal 
states are anticorrelated.* This corresponds to coherent phase modula- 
tion where the two signals have identical envelopes (consistent with 
the power limitations) and differ in phase by zw radians. Such a sys- 
tem operates by sending pulses with phase either 0 or =z relative to 
some standard reference phase. With a system of this type, it is neces- 
sary to provide this standard phase at each repeater in order that 
regeneration may be performed. An alternative approach is differen- 
tially-coherent phase-shift-keying (DCPSK). In DCPSK, the phase 
of each pulse is used as the reference for determining the phase of the 
next following pulse. The information is, therefore, carried in the 
relative phase or, equivalently, it is carried in whether or not the 
phase changes between pulses. The penalty in noise immunity for 
using a DSPSK system instead of a coherent PSK system amounts to 
about 2 dB for error-rates of about 0.01 but decreases with increasing 
S/N. For signal-to-noise ratios which give error-rates of the order 
of 10-° (the assumed acceptable error-rate for this experiment) the 
degradation in noise immunity is less than 0.5 dB. (See, for example, 
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Lawton.*) Since the system under consideration is to be operated with 
error rates of this order, the only form of modulation to be considered 
in this paper is DCPSK. 

A block diagram of a typical repeater is shown in Fig. 1. The pur- 
pose of this figure is to show the overall layout of the repeater and to 
indicate where the IF portion of the repeater fits in. Figs. 2 and 3 
are block diagrams of the two experimental model repeaters which this 
paper describes, along with the test equipment used in the experiments. 
The portions of these figures contained within the dotted lines are 
more detailed versions of the single block labeled “IF portion” in 
Fig. 1. 

The two model repeaters used somewhat different modulation 
schemes to achieve the binary DCPSK. In binary DCPSK, the infor- 
mation is carried in the phase change of a signal between two sampling 
points. DCPSK signals can have several forms since the only require- 
ment is that the phase must make either of two prescribed changes 
between each pair of adjacent sampling points. Two somewhat ideal- 
ized classes of signals are considered below. These two classes represent 
limiting cases since any physically realizable signal would contain some 
of the properties of each. We designate these classes as AM-DCPSK 
‘and FM-DCPSK. The AM-DCPSK signals are created (at least con- 
ceptually) by generating a separate pulse for each time slot with one 
or the other of two phases. Each pulse is thought of as being generated 
independently of pulses in other time slots, but is not necessarily 
confined to a single time slot, i.e., intersymbol interference is allowed. 
As an example, such a signal might have the form 


INPUT 
WAVEGUIDE 






TO REPEATER 


CHANNEL CHANNEL 
DROPPING ADDING 
FILTERS F FILTERS 


UP 
CONVERTER 


Fig. 1— Block diagram of a complete repeater. 
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S() = Yy Solt ~ nP) exp Lilet + a, (1) 


where So(é) is the pulse-shaping factor which is slowly varying com- 
pared with the exponential factor, and a, is a chance variable which 
can take with equal probability either of two values which differ by z. 
The information is carried in whether | an—1—- on | equals 0 or x. In 
such a system, the amplitude is varied, but the carrier frequency of 


the individual pulses, w/27, is a constant. If the pulses were nearly 
resolved, i.e., if 


| S(t) |K S00) for fel= 2/2 (2) 


S(€) would look like a pure AM signal as illustrated in Fig. 4(b). 

The FM-DCPSK class is one in which the amplitude of the signal 
remains constant and the phase change (if any) between adjacent 
sampling times is effected by changing the carrier frequency. As an 
example, such a signal might have the form 


DIFFER- 
ENTIAL 


PHASE 
DETECTOR 





COUNTER 





Fig. 2— Block diagram of the AM-DCPSK model repeater. 
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Fig. 3— Block diagram of the FM-DCPSK model repeater. 


t nT 
S() = exp io + i w(t’) a, where i w(t’) dl’ = a, (3) 
0 (n—1)T 


and a», is as defined for the AM-DCPSK system. An example of such a 
signal is shown in Fig. 4(d). 

The signals used in the repeater to be described in Section III were 
essentially the AM-DCPSK class, although, in practice, some fre- 
quency modulation of the tails of the pulses is inevitable. Signals used 
in the repeater described in Section IV were essentially of the FM- 
DCPSK class, although some amplitude modulation is inevitable due 
to the finite bandwidth of the devices used in the experiment. 

The AM-DCPSK signal has the advantage that a comparatively 
complete theoretical analysis of error rate in the presence of inter- 
symbol interference is available. The FM-DCPSK signal has two ad- 
vantages, namely, that a phase-locked oscillator can be used to pro- 
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vide limiting and gain, and that it is comparatively easy to recover 
timing directly from the signal. 

Certain features of the model repeaters shown in Figs. 2 and 3 are 
identical. Others are quite different. The components which are com- 
mon to both models are described below. The others are described in 
Sections ITT and IV. 

All of the traveling-wave tubes used are BTL experimental Model 
M1917. 
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2.1 Random Word Generator 

The random word generator circuitry (for both experimental model 
repeaters) is identical to the baseband regenerator used with the 
AM-DCPSK model (to be described in Section III). Wideband ran- 
dom noise from three cascaded amplifiers is used for a random input 
signal to this “regenerator.” The output is, therefore, a sequence of 
random positive and negative pulses occurring at the 160-megabit rate. 


2.2 Differential Phase Detector 

The differential phase detector (DgD) is shown in Fig. 5. This device 
is used in both models as the DgD and, in addition, an adaptation of 
the device is used in the comparator for the AM-DCPSK model and 
another adaptation is used in the timing recovery circuit of the self- 
timed version of the FM-DCPSK model. The behavior of this device 
is discussed in the appendix of Ref. 5 for an FM-DCPSK signal. The 
behavior of the device with one time slot delay for an AM-DCPSK 
signal is quite straightforward and will be discussed briefly in Section 
III. 


Ill. THE AM-DCPSK REPEATER 


3.1 General Description of the AM-DCPSK Repeater 

A block diagram of the AM-DCPSK Repeater is shown in Fig. 2. 
A random binary signal (at baseband) is provided by the random word 
generator at a bit rate of 160 Mb/s. This random “message” is then 
transferred onto the 11.2-GHz carrier by the first pulse modulator— 


RIBLET RIBLET 
COUPLER COUPLER 
NO. 1 NO.2 










OUTPUT 







PHASE 
SHIFTER 
¢ 


1! TIME SLOT 
DELAY 


Fig. 5 — Differential phase detector (D » D). 
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the two states now being the two phases 0 or z. The pulse signal is 
then amplified by TWT 1. Part of the signal (hereafter called the 
reference signal) is then tapped off in the directional coupler and 
stored in the delay line for later comparison with the regenerated sig- 
nal. The remainder of the signal passes through filter F',. Noise is 
added to the signal and the combined signal and noise are filtered by 
F, and amplified by TWT 2. The differential detector then processes 
this corrupted signal and the regenerator makes the decision as to 
whether a change or a same has occurred, regenerates a positive or 
negative pulse accordingly, translates back into differential binary 
and regenerates the translated signal into a form suitable for driving 
the second pulse modulator. This second pulse modulator (which de- 
rives its CW from the same source, a klystron, as the first pulse modu- 
lator) then provides a signal which carries the same message (except 
where errors are made) as the output of the first pulse modulator. 
This signal is amplified and combined with the reference signal in the 
comparator. The comparator then signals the counter when an error 
has been made. 


3.1.1 Differential Phase Detector for AM-DCPSK Pulses 


The AM-DCPSK signal is of the form given in (1). Straightforward 
analysis of Fig. 5 (assuming ideal square-law detectors) with the 
delay path equal to one bit interval shows the output of the DgD 
to be 


4850) Cos (a, — Qn-1 — ¢) 


at the middle (the sampling time) of the nth time slot. For ¢ = 
this signal is +493(¢) when a, = a,-; and is —3S>(t) when a, = 
Q,-. + 7. For g = w the opposite result obtains. 

In the actual experiment, the signals are corrupted by both inter- 
symbol interference and noise. The regenerator must make its decision 
on the basis of this corrupted signal and respond accordingly. 


~~ 


3.1.2 Pulse Modulator 


The pulse modulators convert an 11.2-GHz CW signal into pulses 
having one of two possible phase states in accordance with the base- 
band signal pulse. Positive drive pulses produce an output of one phase 
and negative drive pulses produce an output shifted by 180 degrees. 

The modulator consists of a Western Electric 2B hybrid junction 
with a matched pair of 1N78 diodes shunting the conjugate arms. One 
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arm contains a phase shifter in addition to the diodes. The device is 
shown schematically in Fig. 6. 

The operation of the modulator depends upon the sotnh loss of 
the diodes and the balance between them. They are de biased to be 
matched so that very little power is reflected from them. Application 
of a baseband pulse to the diodes then produces reflected RF pulses 
in the conjugate arms. With the proper setting of the phase shifter 
they will add in the output circuit yielding an output pulse. When 
the drive pulse is of the opposite polarity, the reflected signals in the 
conjugate arms both differ by 180 degrees from the previous case. 
The resulting output pulse is then also shifted by 180 degrees. 

In practice the phase shift from one state to the other is found to be 
180 + 5 degrees. The -+5 degrees represents the limit of accuracy of 
the measurement. The modulator loss, (the ratio of CW power in to 
pulse power out) increases as the CW input drive is increased or as 
the baseband drive is decreased. Under the low-drive conditions used 
in this experiment this loss is of the order of 20 dB. 


3.1.3 Baseband Regenerator 


The baseband regenerator (Fig. 7) samples the polarity of the 
differential detector output at the center of the time slots and trans- 
lates this information into an output train of equal amplitude positive 
and negative pulses which are converted into 0 or « RF phases in the 
pulse modulator. In the absence of errors, this pulse train has a 
one-to-one correspondence with the original signal. 

The regenerator consists of three direct-coupled stages of tunnel- 


RF PULSES OUT 


DRIVE PULSE DRIVE PULSE 


11.2 GHZ 11.2 GHZ 
DETECTOR DETECTOR 
te 





11.2 GHZ CW IN 


Fig. 6— Pulse modulator schematic diagram. 
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(b) baseband regenerator-schematic. 
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diode pairs. The first and third stages are a form of Goto-pair majority- 
logic circuit triggered at the 160-Mb/s rate by 1-nsec timing pulses. 
The middle stage is an adaptation of a well-known “flip-flop” circuit. 

The input signal for the first stage is the output of the differential 
phase detector. (Some delay is provided in the coupling to minimize 
interference between the input signal and the pulses generated by the 
first stage.) The timing of the 160-Mb/s triggering pulses is adjusted 
so that they coincide with the centers of the time slots of the input 
signal. An output pulse is triggered in every time slot; its polarity is 
dependent upon the majority polarity of the input signal at the 
sampling time. 

The second stage, the state-switcher, consists of two tunnel-diodes 
connected in series. Each time a positive pulse arrives the diodes ex- 
change states. The diodes remain in their respective states so long as 
no positive pulse appears. The circuit time constant allows changes to 
be made as fast as the maximum requirement of the system—up to 
once every 6.25 nsec. 

When the diodes exchange states the voltage at the junction of the 
two diodes changes between two discrete values relative to ground. 
This point is coupled to the junction of the third stage diode pair, and 
the timing of the third stage triggering pulse is adjusted so as to have 
decision times occur midway between any possible state changes. 

The third stage, operating exactly as the first stage, then puts out 
pulses which are positive when the state switcher is in one state and 
negative when it is in the other state. These output pulses provide the 
drive for the RF pulse modulator described earlier. 


3.1.4 Comparator 


The function of the comparator is to compare the regenerated 
version of the corrupted signal with the uncorrupted reference signal 
and to indicate to the counter when an error has been made by the 
regenerator. Information in a DCPSK signal is carried not by the 
phase of an individual pulse but by the relative phase between adja- 
cent pulses. The four possible situations are listed in Table I. 

In the first two situations in Table I, an error was made; in the last 
two, no error was made. | 

As an example, the signal with phases 


0070770 


is equally well represented, after regeneration, by itself or by its exact 
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TABLE I — PossisLtE RELATIVE INFORMATION COMBINATIONS 
IN THE COMPARATOR 











Relative phase between two Relative phase between two corresponding 
reference pulses regenerated pulses 
Change (7) Same (0) 
Same (0) Change (1) 
Same (0) Same (0) 
Change (7) Change (1) 


opposite, viz., 


rarOnrO0O0z. 


We can refer to the case where the regenerator is reproducing the input 
exactly as the “G mode” and the case where it is reproducing the oppo- 
site of the input as the “U mode.” (Such a distinction has meaning 
only where some absolute phase reference exists for the original and 
regenerated signals. This is the case in our experiment since both 
modulators are supplied from the same RF source; this is probably not 
the case in an actual system where the regenerated signal would not 
necessarily be transmitted at the same carrier frequency as the in- 
coming signal.) 

A little thought will convince the reader that the effect of an error 
by the regenerator is to cause the regenerated signal to change modes 
(U to G or G to U). In our example of the preceding paragraph, sup- 
pose that the change between the third and fourth symbols is (errone- 
ously) interpreted by the regenerator (due to noise) to be a same. If 
the regenerator was originally functioning in the G mode, the result 
would be as shown 


Input: 0070770 


Output: 007700 7c. 


G Mode 7 U Mode 
Error 
Note that the regenerator responds to sames and changes, not to 
absolute phases. . 
Therefore, the comparator must be designed to detect transitions 


between modes of operation and to respond to these transitions. 
The comparator is identical in design and construction to the dif- 
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ferential phase detector except that both inputs to the first Riblet 
coupler are used. It is shown in Fig. 8. Let S,(¢) and R,(t) represent 
the regenerated and reference signals in the nth time slot, respectively. 
Then 


S,(Z) = So(é) exp [j@t + a,)] 
and 


R(t) = So(t) exp [jt + B, + 9], 


where a, and B, are the phases of the nth pulse in the regenerated and 
reference signals respectively and @ represents the phase shift in the 
long delay line (which is introduced into the reference signal path in 
order that corresponding time slots of the reference and regenerated 
signals be compared) and the phase shifter in the reference signal 
path. A straightforward analysis of Fig. 8 shows that the output of 
the comparator in the kth time slot is 


Vi = 3[cos (8, — Br-1 — 9) — Cos (a, — a1 — 9) 
+ sin (6, — a. + @— 9) + sin (a — f-1 — 0 — ¢)]. 

From this equation one can see the following results for the cases 
of interest. First Vig = 0, 0) = —V(¢ = za, @) for all values of 6. 

Consider the case 6 = 0, g = 0. There is a pulse of amplitude +1 
whenever an error is made and no pulse when there is no error. The 
pulse is positive if the regenerated signal changes phase and the refer- 
ence signal does not. It is negative if the reference signal changes phase 
and the regenerated signal does not. 


S(t) 


INPUTS 
—> 
| PHASE |R x xX 
6 


ASE 
sHlerER 
sHlerER 







DETECTOR | OUTPUT 
DIODES 


1 TIME SLOT 
DELAY 


Fig. 8 — Comparator for AM-DCPSK repeater. 
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Consider the case 6 = 7/2, ¢ = 0. There is a pulse of amplitude +2 
whenever the regenerated signal changes from the G to the U mode 
but no pulse when the signal changes from U to G. (@ = —7/2 gives 
a pulse on a U => G transition but not G — U.) The sign of the pulses 
is as described in the 6 = 0 case. This type of operation gives exactly a 
scale-of-two reduction in the counting rate and an increase of 6 dB 
in pulse height of the error signal. Since both of these effects are ben- 
eficial, this type of operation was generally used. 


3.1.5 Error Counting Circutt 


The detected output of the comparator contains cancelled pulses 
except where an error has occurred. Errors results in additions of 
the compared pulses which appear as either positive or negative out- 
put pulses. Since only pulses of one polarity can be counted at a time, 
the positive pulses are eliminated and only the negative pulses counted. 
Two settings of the comparator phase shifter, g, 180 degrees apart, are 
therefore required to make a complete count of the errors. 

The output signal is fed into a tunnel-diode unipolar amplitude 
discriminator (See Fig. 9). This is another Goto-pair of the type used 
in the regenerator but it is center-biased negatively to prevent positive 
input pulses from switching a diode. No timing is used. When a nega- 
tive pulse of sufficient amplitude is received, the pre-biased diode 
switches to its second state and remains there for a maximum of 2 
nsec, at which time the reflected pulse from a short-circuited delay line 
quickly returns it to its first state and it is ready to receive another 
pulse well before the next time slot. 

The input level is adjusted so that only error pulses are passed. This 
is to prevent counting as errors the half-amplitude pulses which result 
from an occasional absence of pulses in time slots of the input signal, 
These are not properly to be considered as errors but only as trivial 
deductions from the 160-Mb/s total. 

The output from the tunnel-diode amplitude discriminator is fed into 
a pulse amplifier followed by another amplitude discriminator and 
thence into a Hewlett-Packard 524-C Counter. 
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Fig. 9 — Error-counting circuit. 
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3.2 Procedure 


3.2.1 Adjustment of the Random Signal 


The statistics of the signal can be adjusted by changing the level 
and/or spectrum of the noise which drives the random word generator 
as well as by adjusting the bias on the various stages of the random 
word generator. Necessary conditions for a random signal, which were 
verified by monitoring both the envelope and the differentially de- 
tected “random” signals on the sampling scope are 


(2) the number of pulses in the two phase states be equal, 
(12) the number of changes equal the number of sames, and 
(2) the four possible transitions (same — same, change — change, 
same — change, change — same) between time slots be equally pop- 
ulated. 


The signal was made to satisfy these three requirements. They consti- 
tute our only check on randomness. 


3.2.2 Calibration of Noise and Signal Power 


The average noise power was read directly on a Hewlett-Packard 
Model 481B power meter. This calibration was checked frequently 
during the course of the experiment and was held to +0.1 dB. 

The noise came primarily from a noise source consisting of two 
model M1917 travelling-wave tubes in tandem followed by a 280 MHz 
filter. (An additional —26.5 dBm of noise was introduced by TWT 1). 

The peak signal power was determined by detecting the envelope of 
the signal with a calibrated detector and setting the level for a peak 
power of +12.0 dBm. This measurement depends on the calibration of 
the detector diode, ability to read the pulse height as displayed on the 
sampling scope accurately, and the choice of which pulse height to use, 
since intersymbol interference caused several pulse heights to be 
present. The experimental error here is estimated to be +0.4 dB. 

Once the peak signal power, S, is set at +12.0 dBm, the average 
signal power corresponding to S, (S), can be read on the power meter. 
The value of (S) can be reproduced with an accuracy of +0.1 dB and 
other values of pulse power can be set by changing (S) the appropriate 
amount. Thus even though the absolute value of S is in question by 
-+0.4 dB, the error in the relative power levels for the same pulse shape 
is only +0.2 dB. 
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3.2.3 Determination of Threshold 


Consider a situation where the peak signal power, S, going into the 
differential phase detector is sufficient for proper operation of the 
regenerator and the noise power is negligible. Now if S is slowly de- 
creased, a value S, will be reached where some errors occur. As S is 
further decreased, a second value S. (about 2.5 dB below S,) is reached 
where the regenerator begins to completely disregard the input signal. 
We define the threshold, 7’, as 


T= 3(S, + S2). 
One finds experimentally that T + 0 dBm. 


3.2.4 Determination of Intersymbol Interference 


In the presence of intersymbol interference, the envelope-detected 
pulse will have several values at the decision point due to the four 
possible states of its nearest neighbors. The ratios of these pulse 
heights (in dB) can be measured directly on a sampling scope. From 
this the value of pr, where 


S(O 
pr = 20 log 


with So as defined in (1), can be calculated. 


3.2.5 Summary of Experimental Errors 


The experimental errors are summarized in Table II. 


3.2.6 Timing and Center Bias Adjustments 


The timing for the signal pulses and the regenerator is derived from 
a single 160-MHz clock. The decision time was set by adjusting the 


TABLE I] — SUMMARY OF EXPERIMENTAL ERRORS 


Quantity Experimental error (dB) 


Peak signal power—absolute value +0.4 
Peak signal power—relative valuc +0.2 
Average noise power +0.1 
Threshold +1.0 
S/N (absolute) +0.6 
S/N (relative) +0.3 
S/T (absolute) +1.2 
S/T (relative) +0.2 


PT +1.5 
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timing of the regenerator pulse for minimum error count with a very 
small input signal. 

The center bias setting of the first regenerator strongly influences 
the ratio of errors made in detecting changes, N,, to errors made in 
detecting sames N,. In the presence of time crosstalk, the optimum 
setting of the center bias is not zero but rather in the direction to 
favor changes. In the experiments performed with resolved pulses 
(no intersymbol interference) the center bias setting was determined 
by balancing V, and N, at a value of S/N such that NV, + VN, & 80 
counts per second. In the experiments performed with unresolved 
pulses (significant intersymbol interference), the center bias setting was 
determined by finding the two values of center bias where, in the absence 
of signal and noise, the regenerator jumps from one state to the other 
and taking the midpoint between these values. This setting does not 
optimize (exactly) the error rate, but it does allow a more direct com- 
parison with theory. 


3.3 Results and Comparison With Theory 


3.3.1 Narrow-Pulse Experiment 


The first experiment to be discussed is the measurement of error- 
rate as a function of S/N for several values of S/T. For this experi- 
ment, the filter /', was removed entirely and filter F, was placed be- 
tween the second noise tube TWT-N2 and the noise attenuator. The 
pulses were then completely resolved as seen in Fig. 10(a). The eye 
diagram (output of the differential phase detector) is shown in Fig. 
10(b). The experimental results are plotted in Fig. 11 along with the 
corresponding theoretical curves.° 

The theory for this comparison assumes that all pulses are of the 
same peak power S, whereas in fact the modulator produces pulses of 
two slightly different amplitudes. This is an important effect for small 
values of S/T and probably accounts for part of the discrepancy in 
the data for S/T = 4.5 dB and a great deal of the discrepancy for 
S/T = 3.0 dB. Also note that small errors in determination of threshold 
are extremely important when S/T Ff 3.0 dB. 


3.3.2 Unresolved-Pulse Experiment 

Filters F, and F, (as shown in Fig. 2) were adjusted to have 6-dB 
bandwidths of 250 MHz and 262 MHz, respectively. The overall 6-dB 
bandwidth of the two filters in series as used in the experiment was 
then 173 MHz. This widened the pulse considerably. The pulse envelope 
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Fig. 10— Narrow pulse experiment. wave forms. (a) Envelope-detected RF 
pulse. (b) Eye-diagram (output to differential phase detector, input to re- 
generator). 
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Fig. 11— Error rate vs S/N for narrow pulses (negligible intersymbol inter- 
eae Points represent experimental values; curves are theoretical results 
rom Ref. 6. 


and the eye diagram for this signal are shown in Fig. 12. The inter- 
symbol interference, pr , was measured to be 22.5 + 1.5 dB per tail. 
The experimental results are plotted in Fig. 13 along with the theoreti- 
cal curves. 


3.3.3 Factor-of-Two Experiment 

Various arguments have been presented in the past which (errone- 
ously) conclude that (at least under conditions of low error rate) an 
error in decision results in two errors (in adjacent time slots) in the 
regenerated signal. Salz and Saltzberg’? have shown that these argu- 
ments are fallacious and that a single error in decision results, with 
about 90 percent probability, in a single error in the regenerated signal 
at the error rates used in these experiments. 

Since these double errors, if they existed, would occur in adjacent 
time slots, our counter (video bandwidth is 10 MHz) would not be able 
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Fig. 12— Widened pulse experiment wave forms. (a) Envelope-detected RF 
pulse. (b) Eye-diagram (output to differential phase detector, input to re- 
generator). 
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Fig. 13 — Error rate vs S/N for band-limited pulses. Points represent experi- 
mental values, curves are theoretical values from Ref. 5 evaluated pr = 22.5 
dB (the experimentally determined value for this waveform). 


to resolve these pulses without benefit of the scale-of-two feature of 
the comparator. As explained in Paragraph 3.1.4 this scale-of-two can 
be removed by changing the phase of the reference signal 90 degrees. 
When this is done, the total count should double if the number of 
double errors is negligible. On the other hand, if double errors were 
created for every decision errors, the total count would increase only 
50 percent since (for random signals) half of the “secondary” error- 
pulses from the comparator would be of the same polarity as their 
associated “primary” error pulses and hence could not be resolved in 
the counter. . 

Within experimental errors of a few percent, the total count is 
doubled when the scale-of-two is removed indicating that the errors 
in the regenerated signal do not occur in pairs. 
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3.3.4 Test of Effect of Phase Error in the Differential Phase Detector 
on Error Rate 


The degradation in S/N produced by a small error, 89, in the phase 
shift (or equivalently in the length) of the delay loop of the differ- 
ential phase detector has been calculated.’ This effect is of interest 
because 8g is a function of temperature; hence, it cannot be made 
and kept arbitrarily small in a practical repeater. Fig. 14 shows the 
experimental points and the theoretical curve of equivalent degradation 
in S/N as a function of | 8 |. The points designated © were taken with 
dy < 0, those designated [] with dy > 0. 


IV. THE FM-DCPSK REPEATER 


4.1 General Description of the FM-DCPSK Repeater 

Fig. 3 is a block diagram of the FM-DCPSK repeater. The overall 
operation of this repeater is quite similar to that of the AM-DCPSK 
repeater described in the preceding section. The random word generator, 
error counting circuit, and RF noise source are identical to those de- 
scribed in Paragraphs 2.1, 3.1.5, and 2.1, respectively. 


4.1.1 FM Deviator 


The pulse modulator used in the AM repeater is replaced with an FM 
deviator. This FM deviator consists of a frequency-modulated Esaki 





dB DEGRADATION 











DEGREES 


Fig. 14— Degradation in effective S/N as a function of phase shift error in 
the differential phase detector. 
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Fig. 15 FM-DCPSK signal; (a) modulating voltage vs time; (b) frequency 
vs time; (c) phase vs time; (d) differential phase vs time. 


diode oscillator. Modulation is produced by pulsing the bias voltage 
either positively or negatively in every time slot. The binary informa- 
tion is contained in the choice of polarity. This voltage pulse causes 
the oscillator frequency f to deviate from its nominal value f,. This is 
illustrated in Fig. 15(a) and (b). This modulation causes a phase 
change $y, given by 


i= [1 — flat. 


During the nth time slot sampling is performed after the phase shift 
has been completed (i.e, at the center of the intervals on which f(t) 
equals f, in Fig. 15(b); thus, dy, represents the phase change between 
the nth and the (n+1)th samples. 
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Optimum noise immunity is obtained when the two possible signal 
states are anticorrelated, that is, when the two possible values of 6¢, 
differ by z. This is achieved by adjusting the pulse amplitude until 
dg, equals --7/2. In the following it will be assumed that this condition 
is satisfied. 

The phase (relative to the phase of the carrier frequency) g(t) for 
the modulation illustrated in Fig. 15(a) and (b) is shown in 15(c). The 
differential phase detector used for this type of modulation is identical 
to the one described in the Paragraph 3.1.1. The phase shifter is ad- 
justed to add (or subtract) a constant 7/2 shift in the delayed signal 
so that it compares ¢(t) with g(¢ — T) + 7/2. Since ¢(t) and o(é — T) 
differ +7/2, y(t) and g(t — 7) + 2/2 differ by 0 and a and the decision 
making circuitry operates just as it did for the system described in 
Section III. The quantity y(t) — [¢( — T) + 2/2] is illustrated in 
Fig. 15(d). 

The requirements on the FM deviator differ from the usual require- 
ments on an FM modulator in that only the area under the frequency- 
versus-time curve is important. There is no requirement that frequency 
be a linear or even a continuous function of the voltage. 

Each of the FM deviators built for this experiment consists of a 
0.5-mA GaAs Esaki diode mounted directly across 50-mil high X-band 
waveguide with one terminal grounded and the other brought out 
through an 11.2-GHz coaxial trap® as shown in Fig. 16. They have an 
output power of the order of —17 dBm. They are quite stable and have 
suffered no noticeable degradation over periods of up to 6 months. 

The effect of the modulation on the amplitude of the output of the 
I'M deviator turns out to be very small if the oscillator is properly 
tuned. Even after filtering the output to the bit-rate-bandwidth the 
amplitude variation is only about +1 dB. Since the +7/2 modulation 
gives complete symmetry about the carrier frequency f, , the optimum 
value of the free-running frequency of the PLO driven by this signal 
is also f,. 


4.1.2 Baseband Regenerator 


The baseband regenerator used in this repeater is just the first stage 
or “first regenerator” of the device used in the AM repeater. Instead of 
driving a state switcher this “first regenerator” drives the FM deviator 
directly. The function of the state switcher, namely, to translate from 
straight binary to differential binary is accomplished automatically by 
the manner in which the FM deviator functions. 
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Fig. 16—FM deviator. 


4.1.3 Timing Recovery 


One of the problems associated with any digital regenerative repeater 
is that of providing a clock to allow proper sampling of the received 
signal. The type of signal used in this experiment is particularly suit- 
able for timing recovery because there is a frequency change in each 
time slot regardless of the message statistics. In this repeater, the 
timing was recovered by taking a portion of the incoming signal and 
putting it into a device which is like the differential phase detector 
except that the delay line is approximately 0.6 bit interval long. This 
device is described in the appendix. 
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4.1.4 Limiter 

An Esaki diode limiter was incorporated into the circuit in order to 
take advantage of the improvements predicted in Ref. 6. This device 
consisted of an Esaki diode oscillator built directly in 50-mil high 
X-band waveguide. Input and output was accomplished by means of a 
Riblet coupler in the experimental repeater (only because a circulator 
was not at hand). In addition to limiting, the device gave about 20 dB 
of gain. 

The repeater was operated both with and without the limiter; re- 
sults of both types of operation are given in Paragraph 4.3. 


4.1.5 Comparator 


The comparison of input and output signals is made at baseband in 
this experiment (rather than at X-band as in the AM-DCPSK experi- 
ment). The output of the random word generator is split with a 6-dB 
matched splitter. One branch goes to the FM deviator, the other 
through a delay line to a baseband hybrid where it is compared with 
the output of the baseband regenerator. The polarities are such that 
if no error has been made the pulses cancel in the output arm of the 
hybrid whereas if an error has been made, the pulses add. This signal 
is then applied to the error counting circuit described in Paragraph 
3.1.5. 

Note that the baseband signal contains the information directly, not 
in differential form as does the X-band signal. Thus, some of the com- 
plicated logic of the X-band comparator is avoided. 


4.2 Procedure 


The procedure followed was similar to that described in Paragraph 
3.2. The two significant differences are (z) the timing was derived from 
the timing recovery circuit, not from the 160-Mb/s clock which ran 
the random word generator, and (72) the signal-to-noise ratio, when the 
limiter was used, was measured at the input to the limiter. 


4.3 Results and Comparison with Theory 


The results of experiments to measure error-rate versus S/N for the 
FM-DCPSK system are given in Fig. 17 for the case where a limiter 
was not used and in Fig. 18 for the case where a limiter was used. 
These results indicate that for an error rate of 10-° the system operates 
with about 1.5-dB degradation from theoretical ideal without the 
limited and with about 0.5 to 0.8-dB degradation with the limiter. This 
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Fig. 17 — Error rate vs S/N for the unlimited FM-DCPSK signal. 


degradation is due in part to the finite-width decision threshold dis- 
cussed in Ref. 6 and in part to the intersymbol interference discussed 
in Ref. 5. The signal-to-threshold ratio is of the order of 12 dB 
for these experiments which, ignoring intersymbol interference, should 
account for about 0.4 dB of the degradation for the unlimited case. 
For the limited case the degradation due to a 12-dB signal-to-threshold 
ratio is about 0.1 dB. It is impossible to consider the intersymbol 
interference quantitatively because the exact form of the signal is 
unknown. Nevertheless, it is not unreasonable to assume that it 
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Fig. 18 — Error rate vs S/N for the limited FM-DCPSK signal. 


accounts for a large part of the remaining discrepancy between the 
experimental and theoretical curves in Figs. 17 and 18. 


V. CONCLUSIONS 


Both AM-DCPSK and FM-DCPSK type systems are technically 
feasible. The FM-DCPSK type seems preferable in that timing infor- 
mation can be obtained by the method discussed above, whereas when 
the AM-DCPSK system is band limited to bandwidth of the order 
of the bit rate the pulses are no longer resolved and timing is quite 
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difficult to recover. In addition, a phase-locked oscillator serves as a 
suitable limiter for FM-DCPSK systems whereas a different—and 
perhaps more complicated—limiter would be necessary for the AM- 
DCPSK system. Finally, the baseband circuitry is much simpler for 
FM-DCPSK because the translation from straight binary to differ- 
ential binary is accomplished automatically by inherent properties 
of the FM deviator. 
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APPENDIX 
Timing Recovery from an FM-DCPSK Signal 

The basic device described in the appendix of Ref. 5 and shown in 
Fig. 5 can be used as an FM discriminator for timing recovery by 
proper choice of the delay time 7. For this device one chooses r such 
that wor = mr. Then (22) of Ref. 5 is, to first order, independent of the 
sign of w(t’) and thus independent of message statistics. One has — 


Vida ee8 { i ; w(t") w. (4) 


The analysis cannot be carried further without assuming a particu- 
lar form for w(t’). As an example, consider the signal given by (3) with 
w(t’) given by 


t’ 
w(t’) = da = cos’ = 


T he 
where a, = -& 1 depending on the message and changes sign only at 
the points ? = (n + 4)T. Substituting this into (4) gives 


V.(d) = cos Z + sin (=) cos (2a! — zr) (5) 


V,() = cos {r [ =e ye r| + 3 COS (z"] sin (2x! 7 x) » 6) 


where (5) applies if a, does not change sign on the interval (¢, t-r) 
and (6) applies if it does. 
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A study of (5) and 6) reveals that for r « T the bit-rate-frequency 
component of Vp» and V, is quite small. As 7 is increased, the bit-rate- 
frequency component of Vo increases up to r = 0.67 and then de- 
creases to zero as +r is increased to J. The hit-rate-frequency com- 
ponent of V, increases as 7 increases from zero to 7’. Thus, the opti- 
mum value of delay for the timing recovery discriminator is r = 0.67’. 

The delay, r, is related to T by another constant. Since we must have 
wl’ = (m + 4)m (see Appendix A of Ref. 5), and we also require 
wot =. m’x for (4), rand T are related by 


T m’ 


T m+ 


Thus, one chooses 2’ such that m’(m + 4) is as near 0.6 as possible. 
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The Suppression of Monocularly 
Perceivable Symmetry During 
Binocular Fusion 


By BELA JULESZ 
(Manuscript received February 10, 1967) 


Symmetries that we can perceive with one eye can be made to disappear 
during binocular fuston—that is, a symmetrical pattern in one of a patr of 
stereoscopic images may not be seen when we view the pair stereoscopically. 
This phenomenon should not be confused with the classically-known bi- 
nocular rivalry in which the left and right images cannot be fused and one 
of the images is alternately suppressed. The type of suppression phenomenon 
reported here is obtained for computer-generated random-dot patterns in 
which locally each picture element can be fused in a stable way. The bi- 
nocularly suppressed symmetry can be one-, two-, and four-fold, and the 
experiments give some insight into the processes underlying the perception 
of symmetry. In addition to symmetries, tt becomes possible to scramble 
teat by exhibiting it stereoscopically. 


I. BINOCULAR FUSION, RIVALRY, AND A THIRD POSSIBILITY 


Recently, the author added a third possibility of perceptual re- 
sponse to the class of stereoscopic images traditionally consisting of 
binocular fusion and binocular rivalry.t In these computer-generated 
stereoscopic images the local and global properties are juxtaposed such 
that locally each picture element can be stereoscopically fused, causing 
the monocularly apparent global symmetry to disappear in the fused 
percept. 

In the demonstration of Ref. 1, the left stereoscopic image consisted 
of randomly selected black and white dots with bilateral (one-fold) 
symmetry across the center horizontal axis. The right image was ob- 
tained from the left image by subdividing it into 20 horizontal stripes 
(of five picture element width) and shifting every even stripe to the 
left and every odd stripe to the right by two picture elements. Such a 
stereo image is shown in Fig. 1. When monocularly viewed, the hori- 
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(b) 


Fig. 1— (a) Stereogram which, when monocularly viewed, contains an image of 
bilateral symmetry across the horizontal axis. When viewed ’stereoscopically, hori- 
zontal stripes are perceived in depth, and the symmetry is suppressed. (b) Stereo- 
gram identical to (a) except that the left image is mirror reflected to permit 
stereoscopic viewing with the supplied front-surface mirror as described in the 
appendix. 


zontal bilateral symmetry is apparent in the left image, whereas the 
right image seems almost random. When binocularly viewed, the 
horizontal stripes are perceived alternately in depth, and if the sym- 
metric pattern is viewed with the non-dominant eye the bilateral sym- 
metry is suppressed in the fused percept.* 


* These stereograms can be viewed stereoscopically by using a prism in front 
of one eye or other similar stereoscopic device. For those readers who do not 
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In Ref. 1 it was emphasized that the demonstration was only an 
example and others could be devised along these lines. Because of the 
theoretical importance of this stimulus class, the present article demon- 
strates several new examples. They are more powerful than that of Fig. 
1 and help to clarify some factors in symmetry perception. 


II, POSSIBLE IMPROVEMENTS OF THE ORIGINAL EXPERIMENT 


Although the original demonstration in Fig. 1, served its purpose, it 
suffered from some inadequacies. The primary limitation of the 
stimulus is our difficulty in perceiving bilateral symmetry along a 
horizontal axis. Is it possible to create stereoscopic images in which 
more powerful monocular percepts of symmetry are binocularly sup- 
pressed? For instance, can the monocularly apparent bilateral sym- 
metry across a vertical axis be made to disappear, or what is more, can 
we binocularly suppress the strongly perceivable two-fold or four-fold 
monocular symmetries? 

Another criticism can be raised when inspecting Fig. 1. One might 
argue that the right image is not symmetrical, but because of a few 
recognizable clusters which are similar in the upper and lower half 
fields, the appearance of the right image deviates from complete 
randomness. Although we could argue that the bilateral symmetry in 
the left field is effortlessly perceived, whereas in the right field we have 
to scrutinize the stimulus to detect departure from randomness, never- 
theless, attempts should be made to deal with this criticism. 

A third objection might concern the loose way of specifying the 
amount of symmetry suppression in the binocular percept. Of course, it 
might be a simple task to let subjects rank order the amount of sym- 
metry in the left and right monocular percepts and in the fused 
binocular percept. On the other hand, the phenomena shown here are 
universal, i.e., observers with adequate stereoscopic vision do deviate 
little in their judgments. The skeptic can easily check the validity of 
these findings himself. Therefore, instead of quantification, all ex- 
perimental efforts have gone into the creation and display of increas- 
ingly sophisticated stimuli. It should be mentioned that after rank 





possess such viewing aids, a front-surface mirror is included inside the back cover 
of this issue. Since all the stereograms except Figs. 1, 2 and 10 contain an image 
with vertical bilateral symmetry, they can be fused both ordinarily or with the 
aid of the mirror as described in the appendix. Figs. 1(b), 2(b), and 10(b) cannot 
be fused with the aid of a prism but only with a mirror—Figs. 1(a), 2(a), and 
10(a) can only be fused with a prism. In order to obtain the described binocular 
percepts with mirror viewing and not the reversed depth percepts, the left 
images of the stereograms should be viewed with the aid of the mirror by the left 
eye and the right images directly by the right eye. 
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ordering the left and right monocular percepts with respect to sym- 
metry (or lack of symmetry), the symmetry in the binocular percept 
does not have to be the mean of the monocular percepts, but can be 
even weaker than that of the monocular percept with the scrambled 
symmetry. 

In the experiments reported here a systematic attempt was made to 
break up the clusters and strengthen the perception of monocular 
symmetries. Binocular suppression of symmetry was still observed. 


III. NON-TOPOLOGICAL OPERATIONS IN BINOCULAR VISION 


The existence of locally correlated but globally uncorrelated stereo- 
scopic images is based on a fundamental difference between monocular 
and binocular vision. When viewed with one eye, the retinal projec- 
tions of objects usually change size and shape in a continuous manner. 
Exceptions are a few hidden parts which may suddenly enter or depart 
the visual field. Therefore, monocular perception mostly operates on 
continuous (topological) transformations of the stimuli, and pro- 
longed departure from spatial-temporal continuity results in strange 
phenomena and distortions. (For example, imagine a television picture 
out of horizontal synchronism.) On the other hand, binocular vision 
can easily combine non-topologically related stereoscopic images to 
yield stereopsis and fusion. Parts of one stereoscopic image can be 
broken up and shifted horizontally in the other image and, if these 
shifts are kept within the critical limit of disparity, the two images 
will be combined in a single three-dimensional percept. These shearings 
and displacements when skillfully applied to binocular viewing can 
destroy many monocular percepts, particularly that of symmetry. 


IV. THE ROLE OF CLUSTERS IN SYMMETRY PERCEPTION 


The right image of Fig. 1, although not symmetrical, is seen to 
deviate from randomness when carefully viewed. One of the main cues 
for bilateral symmetry has been removed, namely the large sym- 
metrically shaped clusters in the immediate neighborhood (within + 5 
picture elements) of the horizontal symmetry axis. On the other hand, 
within the five picture-element wide horizontal stripes, some charac- 
teristic micropatterns are formed by chance; these can be recognized 
and matched in the upper- and lower-half fields. In the binocular per- 
cepts, because each of these similar micropattern pairs is seen at op- 
posite depth levels, their matching becomes more difficult. Therefore, 
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the binocular percept seems even more random than the right monocu- 
lar view. 

In addition to the microclusters formed by chance, the same random 
fluctuations can produce macroclusters which extend to large areas. 
The alternate horizontal shifts by a few picture elements, performed 
on the even and odd stripes are not adequate to break up these large 
dark or bright areas, and these are the last cues left in the binocular 
percept to reveal symmetry. 

Is it possible to reduce the traces of symmetry by breaking up the 
micro- and macro-clusters in the random dot textures? Several at- 
tempts were made to break up the clusters. The first was a preprocess- 
ing of the left field of Fig. 1. Whenever a picture element was sur- 
rounded by more than six picture elements of the same brightness, its 
brightness value was complemented from black to white or vice versa; 
otherwise it was kept unchanged. This intervention prevented the 
formation of black and white clusters within a 3 X 3 array, but long 
horizontal and vertical lines were still present and were strongly 
perceivable both in the left and right images. A second cluster-breaking 
operation was therefore performed. If out of seven adjacent picture 
elements which lay on a horizontal or vertical line, six or more were 
white or six or more were black, the center one was complemented to 
opposite brightness value. The outcome of these two cascaded opera- 
tions applied on a random dot pattern (similar to the left field of 
Fig. 1) yielded the left field of Fig. 2. This image looks very intricate, 
and the horizontal bilateral symmetry is mainly apparent because of 
the patterns close to the symmetry axis and because of some long 
diagonal line clusters that remained intact. 

Another way of reducing recognizable micropatterns is to diminish 
the width of the horizontal stripes which are alternately shifted in the 
right image. In Fig. 2 the horizontal stripes in the right image are only 
two picture-elements wide (in contrast to the five picture-element 
stripe width in Fig. 1). Thus, no micropattern larger than two picture 
elements in the vertical extent stays unchanged. The right image of 
Fig. 2 gives a more random impression than the right image of Fig. 1. 
Unfortunately, with smaller stripe width, the amount of disparity has 
to be reduced in order to obtain stable stereopsis. Therefore, in Fig. 2 
the even stripes are shifted to the left by two picture elements while 
the odd stripes are kept unchanged. The total disparity of Fig. 2 is only 
two picture elements, whereas in Fig. 1 it was twice this amount. Per- 
haps this precaution is unnecessary as seen in Figs. 7 and 8 which con- 
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Fig. 2— (a) Stereogram which is similar to Fig. 1 except that the width and 
binocular disparity of the stripes is reduced and cluster formation of many proxi- 
mate dots of equal brightness is prevented by preprocessing. When viewed stereo- 
scopically, a transparent textured surface is perceived above the background 
(b) Stereogram identical to (a) except that the left image is mirror reflected to 
permit stereoscopic viewing with the supplied front-surface mirror as described in 


the appendix. 
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tain two picture-element wide stripes and a total disparity of four pic- 
ture-elements, yet yield good stereopsis. When Fig. 2 is binocularly 
viewed, the thin stripes cannot individually be resolved but are 
perceived as a transparent textured plane in front of a solid textured 
background. In the fused image the symmetry is strongly suppressed, 
particularly when the non-dominant eye views the symmetric image. 
On the other hand, the reduced disparity makes it easier to find a few 
micropatterns in the two depth planes by scrutinizing the stimulus. 

It should be possible to vary the stripe width in such a way that the 
upper half field contains combinations different from those of the lower 
half field, thus further reducing similar micropattern pairs. Neverthe- 
less, instead of further attempts with horizontal bilateral symmetry, 
we turn our attention to vertical bilateral symmetry. 


V. VERTICAL BILATERAL SYMMETRY 


In these experiments the horizontal bilateral symmetry had two 
disadvantages over the vertical case. The first disadvantage has been 
already discussed; it was pointed out that horizontal symmetry is less 
perceivable than vertical symmetry. The second disadvantage will now 
be discussed. In Figs. 1 and 2 the left and right images contained the 
same picture elements except for a few picture elements that were un- 
correlated in the small areas affected by the horizontal shifts. On the 
other hand, it is known from the random-dot stereoscopic image tech- 
nique that areas presented to only one eye’s view are perceived as 
continuations of the depth plane furthest behind.?® 

This perceptual response permits us to reduce further the similarity 
between the left and right image pairs. For instance, suppose we con- 
sider a left image with vertical bilateral symmetry. We now subdivide 
it into twenty vertical stripes of five picture-element width. The right 
image is identical to the left one, except each even vertical stripe is 
shifted to the left by two picture elements, as though it were a solid 
sheet. Because of the shift, bars two picture-elements wide along the 
left sides of the shifted stripes will be hidden by the dots which belong 
to the vertical stripes. The right sides of the shifted vertical strips will 
uncover new areas, which will be filled in with new random dots. Thus, 
every shifted even stripe will hide a bar (which belongs to the sur- 
round) of two picture-elements width (out of the five), and the odd 
stripes (which belong to the surround) of three picture-element width 
will be extended by a newly generated bar of two picture-elements 
width. Such a stereoscopic image is shown in Fig. 3, in which, in addi- 
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Fig. 3—Stereogram which, when monocularly viewed, contains an image of 
bilateral symmetry across the "vertical axis. When viewed stereoscopically, vertical 
stripes are perceived in depth, and the symmetry is suppressed. 


tion to the shearing and shift, 2/5 = 40 percent of the dots are dif- 
ferent in the right image as compared to the left. When the left image 
of Fig. 3 is monocularly viewed, the vertical bilateral symmetry is 


immediately apparent; when the images are binocularly viewed, verti- . 


cal stripes are seen in front of a background, and symmetry is sup- 
pressed. The only cue for symmetry is the presence of some large 
black and white clusters which can be paired in the left and right half 
fields. 

Fig. 4 is similar to Fig. 3, except that the same cluster breaking 
operation used in Fig. 2 was applied. Here the binocular percept is 
quite free from symmetry, except for a few diagonal checkerboard-like 
patterns which might be found under scrutiny. Anyway, there is no 
doubt that further preprocessing could greatly reduce these few re- 
maining recognizable structures. It is also true that the bilateral sym- 
metry is less apparent in the left image of Fig. 4 than before cluster 
breaking. It is a delicate operation to find the amount of cluster break- 
ing which still gives a strong percept monocularly but which causes 
the binocular percept to disappear. 


VI. TWO- AND FOUR-FOLD SYMMETRY 


In a next experiment, a left image with two- and four-fold sym- 
metry, respectively, was generated and a compatible right stereo image. 
similar to Fig. 4 was devised. The only difference is that the vertical 
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Fig. 4— Stereogram which is similar to Fig. 3, except that clusters are broken 
up by preprocessing. 


five picture-element wide alternate stripes are shifted in phase in the 
lower half field compared to the upper half field. Whereas, in the upper 
half field every even vertical stripe is perceived above the background, 
in the lower half field every odd vertical stripe is in front. Such stereo 
images are presented in Figs. 5 and 6 where the former contains a 
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Fig. 5—Stereogram which, when monocularly viewed, contains an image of 
two-fold symmetry. When viewed stereoscopically, vertical stripes are perceived 
in depth which are shifted in position in the lower half field with respect to the 
stripes in the upper half field. The symmetry is suppressed in the binocular view. 
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Fig. 6—Stereogram which, when monocularly viewed, contains an image of 
octal symmetry. Otherwise similar to Fig. 5. 


two-fold symmetry, whereas the latter a four-fold symmetry. The 
middle phase shift prevents the perception of the bilateral symmetry 
across the horizontal axis, while the alternate vertical stripes in depth 
suppress the symmetry across the vertical axis. The suppression of 
binocular symmetry is perhaps less striking for these cases than for the 
one-fold symmetry in Fig. 4; on the other hand, the two-fold and par- 
ticularly the octal (four-fold) symmetry in the left field is so strong 
that the difference between the monocular and binocular perception of 
symmetry is very pronounced. 

There are many other manipulations one could successfully apply 
in order to break up monocular global percepts. Instead of long stripes, 
rectangles can be selected at various depth levels and various widths. 
The horizontal and vertical stripes at various depth levels can be 
intermixed. The illustrations have served only to show the flexibility of 
the random-dot stereoscopic image technique. 


VII. SOME CUES OF SYMMETRY PERCEPTION 


It was already discussed how clusters affect symmetry perception 
and how much stronger bilateral symmetry is perceived across’ a 
vertical axis than across a horizontal one. Now, we are in a position 
to examine some of these problems in more detail. Since the images in 
these experiments are devoid of all familiarity cues, this preference to 
the vertical is characteristic of human perception and is much more 
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prominent than one would expect from psycho-physiological findings 
on the detection of horizontal-vertical lines.*> It is also interesting that 
in two- and four-fold symmetry perception—which are more strongly 
perceived than bilateral vertical symmetry—the strength of perceived 
symmetry across the horizontal axis is greatly increased. 

In the preceding experiments we restricted ourselves to mirror sym- 
metries. For the horizontal bilateral symmetry F(a, y) = F(x, —y), 
and for the vertical bilateral symmetry F(z, y) = F(—z, y) has to be 
satisfied. For the four-fold symmetry in addition to both of these equa- 
tions, F(a, y) = F(—2x, —y) has to be satisfied as well. Besides mirror 
symmetries one can study the perception of centric symmetries, such as 
shown in the right image of Fig. 7. For this case F(x, y) = F(—2x, —y); 
F(x, y) ¥ F(—a, y) and F(z, y) ¥ F(x, —y). This image was derived 
from the octal-symmetrical left image by alternately shifting every 
even horizontal stripe (of two picture-element width) to the right and 
every odd one to the left. The monocularly apparent octal symmetry is 
very strong in the left image, while the centric symmetry is hard to 
perceive in the right one. Nevertheless, some large clusters in the 
corners can be detected in both images, which are also recognizable in 
the stereoscopic view. It is interesting to note that these macroclusters 
are easily detected and matched in spite of their differences in fine 
details. 

Fig. 8 is identical to Fig. 7 except for the clusters, which are broken 
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Fig. 7—Stereogram which, in the left image contains octal (four-fold) sym- 
metry and in the right image contains centric symmetry. 
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up by preprocessing. This experiment shows again that the larger the 
clusters are, the less similar they have to be in detail to be detected 
and matched at symmetrical locations. The centric symmetry in Fig. 8 
is even harder to perceive than in Fig. 7. The symmetry in the binocu- 
lar view is also greatly reduced, yet less so than in Fig. 6, since the 
left and right images of Fig. 8 are 100 percent correlated. In Figs. 2, 4, 
and 8, in which the macroclusters have been broken up in the left 
images, the microclusters have to be identical in their minutest details 
to be perceived as symmetrical. This is shown in Fig. 9, in which the 
left image was derived from a random-dot pattern with four-fold sym- 
. metry and then preprocessed by the cluster-cleaner. Obviously, the 
quadrants of the picture are not identical in their fine detail (since 
local changes by the cluster-cleaner affect the successive complementa- 
tions) but are similar in their larger features. The right image of 
Fig. 9 was obtained by taking a quadrant of the left image and 
reflecting it across the horizontal and vertical axes sequentially. When 
the images are viewed at a short distance, they appear quite different; 
one looks symmetrical, the other not. When viewed from a distance, 
the two images appear identical and symmetrical, since only the 
similar macroclusters are perceivable. 
This observation, that symmetry perception depends on cluster size, 
which in turn depends on texture density explains some of the diffi- 
culties with these demonstrations as they are presented in this article. 
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Fig. 8—Stereogram which is similar to Fig. 7, except the macroclusters are 
removed by preprocessing. 
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Fig. 9— Two images which are similar in their macrostructure, but differ in 
their microstructure. When viewed at a short distance only the right image 
appears symmetrical, while from an increased viewing distance both appear sym- 
metrical. 


The figures as shown are already too small at the best viewing distance 
of 10 inches. If the images are made larger (in excess of 80 degree 
visual angle) the stereoscopically fusable local features will dominate 
stronger over the macroclusters. . 

Another observation, closely related to the previous ones, suggests 
a relationship between average cluster size and distance from the sym- 
metry axis. Clusters on both sides of a symmetry axis can be detected 
as being symmetrical only within a certain distance from the axis, 
which is commensurate with the cluster size. Thus, small clusters 
must he close to the symmetry axis, whereas larger ones may lie pro- 
portionally farther. This explains why the left image of Fig. 9 
deviates from symmetry when viewed at a short distance. For large 
stimuli the large clusters cannot be perceived effortlessly, thus the 
nonsymmetrical microclusters dominate. When the stimulus size is 
reduced, the situation reverses; it is now the symmetrical macro- 
clusters which can be effortlessly perceived, and differences in micro- 
clusters pass unnoticed. 

Ernst Mach a century ago studied symmetry perception by using 
amorphic shapes.® His findings are similar to the results obtained by 
using random textures as long as large clusters of dark and light areas 
are contained in these textures. According to these findings vertical 
bilateral symmetry and centric symmetry can be spontaneously per- 
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ceived. Horizontal bilateral symmetry is less well perceived. When the 
macroclusters are broken up centric symmetry does not yield spon- 
taneous perception, whereas the perception of bilateral symmetry is 
not impaired. 


VIII. SUPPRESSION OF MONOCULAR PERCEPTS IN GENERAL 


Until now the monocular percepts to be suppressed binocularly were 
restricted to global symmetries. Needless to say, many other monocu- 
lar global percepts can be destroyed binocularly. Since symmetry is 
easy to generate, it was emphasized over other perceivable forms. For 
instance, the left random field could contain a written text of black or 
white letters that could be perceived and read with one eye. The right 
image would be totally correlated with the left one, except rows or 
columns would be shifted horizontally in some random way. In the 
resulting binocular percept, the letters would be scrambled both 
because of the horizontal shifts and because of these shifts causing 
various letter-segments to be perceived at different depth. This could 
render the binocular percept unreadable. While in symmetry perception 
there is always a possibility that various amounts of symmetry can be 
traced, for this example a dichotomy can be obtained. Either the 
perceived image contains a readable text, or not. 

One might argue, that it is not surprising that a text in one channel 
disappears when some “noise” is added to it through the other channel. 
This argument seems convincing, but actually the opposite is true. 
This random pattern is not uncorrelated “noise” which masks the text 
in the other display, but a totally correlated pattern which gives rise to 
stereopsis. It is an interesting paradox that when the two images are 
uncorrelated, binocular rivalry occurs. One might expect for this case 
the largest masking of the text by the competing uncorrelated noise; 
the opposite is in fact the case, since during binocular rivalry the text 
is quite often visible as dominance alternates. For the type of stimuli 
demonstrated here, the local fusion is perfect and stable, therefore, if 
the text is masked in the binocular view, it stays so indefinitely. 

Fig. 10 shows such a stereoscopic image. The word “YES” is inserted 
in the random texture of the left image by a computer program. The 
right image is derived from the left image by alternately shifting every 
even and odd horizontal stripe of two picture-element width to the left 
or right direction respectively. When this stereoscopic pair is presented 
binocularly to subjects—who have not seen the images monocularly 
yet—the impression is a transparent textured surface above a textured 
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Fig. 10— (a) Stereogram which, when monocularly viewed contains the word 
“YES.” When viewed stereoscopically, the letters become fragmented, rendering 
the text unreadable. (b) Stereogram identical to (a) except that the left image is 
mirror reflected to permit stereoscopic viewing with the supplied front-surface 
mirror as described in the appendix. 


background. All the eight subjects perceived both surfaces as randomly 
textured without being able to find and read the disguised word when 
instructed to search for it. After being instructed to close the right eye, 
each of the subjects recognized the word “YES” in the left image. The 
uniformly dark letters were intentionally sprinkled with random white 


1218 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1967 


dots in order to aid local fusion. If the macroclusters out of which the 
letters are composed are not broken up, it is possible to detect them 
in the random texture of finer grain. This enables the subjects to shift 
the entire letters in the left field in registration with the unshifted 
segments of the same letters in the right image. In Fig. 10 the letter 
“Y” is more robust than the rest, and after it has been detected 
monocularly, some of the subjects could shift the entire letter in 
registration when binocularly viewed. Of course, by covering this letter 
with more random texture, it can be made to disappear in the binocular 
view as the rest. 

These observations suggest a cluster processor prior to stereopsis. 
If clusters of a certain granularity exist in a finer or coarser textured 
surround, they might be extracted and separately fused from the rest. 
When the clusters in the stereoscopic images have the same average 
clustering, this preprocessing becomes inoperative and stereopsis occurs 
on a point-by-point basis. 

There are many other examples in which monocular percepts are 
suppressed in the binocular view. In Refs. 2 and 7 several random-dot 
stereoscopic images have been presented in which one of the images 
was perturbed, yet this monocularly apparent perturbation was sup- 
pressed binocularly. For instance, when one image of the random-dot 
stereoscopic pair is blurred, binocular depth can be still experienced 
and the binocular percept appears as the sharp image.” This suppres- 
sion of the contour-impoverished image by the contour-rich image 
seems to be a general property of binocular combination and was 
recently studied by Levelt.* Although, these phenomena corroborate 
the basic theme of this article, they differ from the examples given 
here, since they are a special case of binocular rivalry. According to 
Levelt, the greater the difference between contour content in a stereo- 
scopic pair the more the contour-rich channel is weighted. On the 
other hand, in our experiments the stereoscopic pairs are weighted 
equally, and binocular rivalry never occurs—only perfect point-by- 
point fusion. 


IX. CONCLUSIONS 


In 1960 a perceptual phenomenon was reported in this Journal which 
demonstrated that binocular shapes can be perceived from monocularly 
shapeless and contourless, random-dot stereoscopic images.? The find- 
ing that correlated areas in the left and right images could give rise 
to binocular depth perception, regardless of the fact that these areas 
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were completely disguised when viewed by one eye, has several 
theoretical and practical implications. An obvious implication is that 
no monocular global percept (form recognition) is necessary for 
stereopsis. Recently, a reversal of this phenomenon was demonstrated 
by the author.1 A stereoscopic image was devised in which the monocu- 
larly apparent shapes of bilateral symmetry across a horizontal axis 
disappeared when stereoscopically viewed. This phenomenon sharpens 
the implication of the earlier one. It suggests that whenever binocular 
combination occurs, this process precedes or dominates the recognition 
of horizontal bilateral symmetry. 

The demonstration of Ref. 1 had a few inadequacies which were 
overcome in the experiments presented here. In order to improve on the 
weakly perceived bilateral symmetry across a horizontal axis, ways 
were found in which vertical bilateral symmetry could be presented 
monocularly without affecting symmetry suppression in the binocular 
view. Similar binocular suppression was obtained for monocularly 
apparent two- and four-fold symmetries. Micro- and macro-clusters, 
which are always present in random textures, were broken up in the 
stimulus and the results of this preprocessing on symmetry perception 
were studied. 

An elaboration on these experiments substituted monocularly ap- 
parent symmetry with monocularly readable text to be incorporated 
in the stereoscopic images. In the binocular view the letters of the text 
were perceived as being scrambled and the letter fragments were 
perceived at various depths rendering the text unreadable. 
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APPENDIX 

A Method of Viewing Stereograms 

Figs. 1(b), 2(b), 3 through 9, and 10(b) may be viewed stereoscopic- 
ally with the aid of the front-surface mirror supplied inside the back 
cover of this issue. Ordinary (back surface, glass) mirrors cannot be 
used. The mirror is made of polystyrene and is quite fragile—carefully 
remove the mirror, taking care not to scratch or mar the surface. For 
best results, the mirror should be flattened by fastening it to a piece of 
stiff, flat cardboard by means of transparent tape laid along the edges. 
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Fig. 11— The author demonstrating a method of viewing stereograms. 


Fig. 11 shows a method of viewing the stereograms. To obtain 
stereopsis, hold the mirror facing toward the left image so that the left 
eye views the image reflected in the mirror while the right eye gazes 
directly at the right image, as shown in Fig. 11. The mirror should be 
nearly perpendicular to the image plane. The reflected image will 
appear to float over the directly-viewed image. Adjust your position (or 
the mirror) until the reflected image and directly-viewed image are 
superimposed and the reflected image is the same size as the directly- 
viewed image. At this point the images fuse and give the stereoscopic 
effect. 
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Large-Signal Calculations for the Overdriven 
Varactor Upper-Sideband Upconverter 
Operating at Maximum Power 
Output 


By J. W. GEWARTOWSKI and R. H. MINETTI 
(Manuscript received March 6, 1967) 


When a varactor frequency upconverter is used as the output device in a 
communications transmitter, it ts often desirable to operate at maximum 
power output. For such operation, the design procedure includes a large- 
signal analysis under overdriven conditions, requiring the use of a computer 
for solution. 

Equations for the instantaneous varactor charge, current, and voltage 
are derived assuming that only three currents are present: those correspond- 
ing to the signal, pump, and output frequencies. Numerical solutions cor- 
responding to maximum power output are obtained for both graded-junction 
and abrupt-junction varactors. Values of power output, conversion efficiency, 
input impedances, load impedance, and bias voltage are presented for 
ranges of drive level (1 to 2) and varactor quality (0.001 S w;/w, & 0.1) 
sufficient to include most practical designs. 


I. INTRODUCTION 


Varactor upconverters are finding increased application in solid-state 
microwave transmitters. In this application a frequency modulated ITF 
signal is mixed with an unmodulated microwave signal to obtain a 
frequency modulated output signal.t Because of the varactor’s low loss 
and high power handling capacity, this output signal can be used as the 
transmitter output signal without further amplification. Microwave 
conversion efficiencies in a varactor upconverter are typically greater 
than 50 percent.” 

A varactor upconverter can be either an upper-sideband or a lower- 
sideband upconverter. However, since the lower-sideband upconverter 
can present stability problems,? the upper-sideband upconverter is gen- 
erally preferred for the above application. 
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This paper presents a general analysis of lossy varactor upper- 
sideband upconverters, using both graded- and abrupt-junction varac- 
tors. Results will be presented for operation at maximum power output 
for a prescribed drive level. The two input signals and the output signal 
are all three “large” signals in this mode of operation. Penfield and 
Rafuse have presented a theory for nonoverdriven abrupt-junction 
varactors;> however, their analysis gives a conservative value of out- 
put power, as explained by Nelson.* Nelson presents an improved 
upper bound on the varactor charge coefficients, which results in 25 to 
54 percent greater output power than that given by Penfield and 
Rafuse. This paper will use Nelson’s method of computing this upper 
bound, with a slight modification to allow for arbitrary output phase 
angles. Grayzel has presented a similar analysis for “punch through” 
varactors.> His analysis differs from ours in that he assumes a par- 
ticular phase condition for the output current [equivalent to taking 
a = 0 in our (5) ]. Our results show that as much as 16 percent greater 
power output is obtained when a is optimized. 


II, ANALYSIS 


2.1 Model and Assumptions 


The varactor model chosen for the analysis is the usual one consist- 
ing of a constant resistance R, in series with a variable capacitance, as 
shown in Fig. 1. A polarity is assumed such that when the varactor is 
reverse biased, the voltage and charge stored are positive. 

For voltages between the barrier potential and the breakdown volt- 
age, the voltage and stored charge on the variable capacitance are 


related by 
i (2 ie eo 
Verb Oyo () 
for 
®s0; S Vz 
de qs Qn ’ 


Fig. 1— Varactor model consisting of a constant resistance in series with a 
time-varying capacitance. 
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where 
v; = voltage across the capacitance 
® = barrier potential (negative) 


Vz = breakdown voltage (positive) 

q = charge on the capacitance 
Q; = charge at breakdown voltage (positive) 
qs = charge at the barrier potential (negative). 


l 


I 


When the varactor is overdriven the voltage is assumed to clamp to the 
barrier potential 


v; = ® for qq. (2) 
In this region the charge-storage effect is assumed to act like an infinite 
capacitance, so that any amount of charge can be stored without any 
additional voltage drop. This model is a good approximation for micro- 
wave varactors, where the minority carrier lifetime is considerably 
longer than an RF period. Experimental evidence has shown that this 
is still a reasonable approximation at frequencies as low as 40 MHz, a 
typical IF frequency. 
It is necessary to have a parameter which measures the extent to 
which the varactor is overdriven. The drive is defined by® 


tbs Qn — Qmin 
drwe = Gas, =a.) (3) 
where it is assumed that the varactor is always driven up to break- 
down. Thus, drive = 1 corresponds to the “fully driven” case of 
Penfield and Rafuse.* Most practical high-power varactor devices 
operate overdriven, so that drive > 1. 


Another useful parameter is the varactor cutoff frequency, given by? 


f, = a Smax = eee Sot eres (4) 
© 2r =2nrR, = =62nRh1 — y)(Qs — ge)’ 
where Smax 1s the elastance of the diode junction at the breakdown 
voltage. The last equivalence is obtained from (1) and the relationship 
S = dv/dq. 

It will be assumed that only three currents are present in the varac- 
tor, those corresponding to the signal, pump, and output frequencies. 
When the pump and output frequencies are appreciably different, cur- 
rents at other than the three frequencies mentioned are impeded by 
the selectivity characteristics of the circuits. However, if the pump 
and output frequencies are close together, then it is difficult to inhibit 
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currents at other sidebands. The effect of these sidebands on the results 
presented here has not been evaluated. 

Circuit losses are not included explicitly. They may be accounted 
for by increasing the value of R, or by calculating separate input and 
output circuit efficiencies, or by a combination of these two approaches. 


2.2 Kquations 
The charge stored on the variable capacitance may be written as 

q = Qo + 2Q, sin wt + 2Q, sin Wol + 2Q; sin (wat + Q), (5) 
where w; and we correspond to the two input frequencies, 3 = 1 + we 
corresponds to the output frequency, and we is taken to be greater than 
,. Since the two input signals at o; and ws. are independent, their 
phases may be taken arbitrarily as shown. The output phase angle a, 
on the other hand, must be chosen to correspond to maximum power 


output at a prescribed value of drive. 
The instantaneous current is obtained from (5) as 


1 a 20,Q, COS wl + Qwlo COs Wot + 20303 cos (w3t + Q). (6) 
The total voltage v on the varactor is given by 


v=, +R. (7) 
The following quantities may be obtained from (5), (6), and (7). 
The numerical integrations are performed using (1) and (2) for values 
of V;. 
Input resistance at o; : 


1 Fi 
R, — R, + oon. v; Cos wt dt. (8) 


Input resistance at ws: 


1 r 
Re = Rh, + Som V; COS wot dt. (9) 


Load resistance at ws : 


1 i 
R; = —R, = ae. v; COs (wat + a) dt. (10) 


Input elastance at o, : 


T 
S, = on. v; Sin wf dt. (11) 
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Input elastance at we : 


1 ‘ ' 
Se = pau v; sin Wol dt. (12) 


Output elastance at ws : 


T 
S; = aa / v; SIN (w3t -+ a) dt. (13) 
3 0 


In general, the integration interval 7 must be large enough to obtain 
the desired degree of accuracy. To facilitate computation, 2 and os 
are selected to the nth and (n + 1)th integral multiples of o,. The 
integration interval is then equal to 7 = 2/w,. Results for nonin- 
tegrally related frequencies can be obtained from these results by in- 
terpolation. 

The resistances calculated above allow one to compute the powers at 
the three frequencies, using values of current from (6). 

Input power at o, : 


P, = 20:QiR, . (14) 
Input power at o2: 

P, = 20;Q>R2 . (15) 
Output power at o3 : 

Py = 2aQiRs. (16) 


In most applications w2 and 3; correspond to microwave frequencies, 
and ; corresponds to a lower frequency. Thus, we define the micro- 
wave conversion efficiency by 


ee P; _ W3 Q3 Rs. 

a P, Ws Q> hk, 

Another useful expression for this quantity is derived in the Appendix 
as 





(17) 


Pi i i.e (18) 


Gi =p =a Pp (19) 
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Another useful expression for this quantity is derived in the Appendix 
as 


W3 R, mar R, fie (20) 


—* 1 Ry Rs -- R, 
Equation (18) and (20) are easily seen to correspond to the results 
given by the Manley-Rowe’ relations as R, > 0. 
One other important parameter is the bias voltage, computed 
numerically from the expression 


1 T 
Y= [ nae (21) 


2.3 Selection of the Charge Coefficients and Output Phase Angle 

The output power Ps; as given by (16) depends on Q; and R3. Rs 
in turn is a function of Qo, Q1, Q2, Q3, and other parameters. These 
charge coefficients are selected to give maximum output power at a 
prescribed value of drive. Obviously, if the drive level were not re- 
stricted, the output power could increase without limit, since over- 
drive of any magnitude is allowed by the theoretical model chosen 
for the varactor. | 

The instantaneous charge is given by (5). At the onset, five inde- 
pendent variables are unspecified, Qo , Qi , V2, Q3 , and a. Two of these 
variables may be eliminated by using relationships bounding the maxi- 
mum and minimum instantaneous charge. The maximum charge is 
given by 

Qmax = Qs (22) 

and the minimum charge is obtained from the prescribed drive level 
using (3). Qz and qe are, of course, known values for a given varactor. 

After applying these limits to eliminate two of the variables, the re- 
maining three variables are varied to find numerically the values cor- 
responding to maximum output power. In our numerical process Qo 
and Q, are eliminated, and Qs, Q3, and a remain as the independent 
variables. Note that once these quantities are selected, all other up- 
converter operating parameters can be computed using the equations 
in Section 2.2 together with a knowledge of the varactor characteristics. 

The values of dmax and Qmin discussed above must be selected with 
some care. One method of selection would be to take maximum and 
minimum values directly from (5). However, this method is inclined 
to give different results when the frequencies are harmonically related 
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than when they are not. To show this, let us compare two instantane- 
ous charge waveforms of the type of (5), both having the same charge 
coefficients, but the first waveform having harmonically related fre- 
quencies and the second not. The first waveform will repeat its pattern 
with a period 27/w; , whereas the second waveform will never repeat. 
Evidently the second waveform will generally have a larger peak-to- 
peak amplitude, since it is more “random,” i.e., it has a larger assort- 
ment of peaks and valleys. This is especially true when wz is not much 
larger than a, . 

The values of Qmax and Gmin are computed using Nelson’s approximate 
method.* For w2/o; greater than 2 or 3, one may visualize the we and o3 
terms of the charge waveform as being a high-frequency signal, am- 
plitude modulated at w,. Thus, one may write (5) as 


ie Qo =e 20): sin wt +R sin (wot + 6), (23) 
where 
kR=2 Q: ++ Q; + 20:Q3 cos (wt + a) 


Bai et | Gein ot od | 


Q2 + Q3 cos (at + a) 
Let witmax and oitmin be the instants corresponding to the maximum 
and the minimum values of the functions 


Q; sin Wt & VQ? Q3 + 20.Q3 cos (wt + @). 
Then, the maximum and minimum values of instantaneous stored 
charge are given approximately by 


Qmax = Qo + 20; sin Wrbnax + 2VQ? + Q3 + 20.03 cos (wi tmax + a) 





(24) 
and 
Qmin = Qo + 2Q1 Sin wlan — 2VQ? + Q? + 2Q.Q; cos (Wilmin + a). 
(25) 


Equations (8), (22), (24), and (25)-allow one to compute Q» and Q,, 
given the values of Qs , M3, and a for a given varactor and drive level. 


2.4 Calculation Procedure 


The parameters corresponding to maximum output power were ob- 
tained from a digital computer using a computer program based upon 
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the simplified flow chart of Fig. 2. Trial values of a were selected at 1 
degree intervals, and trial values of Q. and Q; were selected at intervals 
of 0.0005 (Qs — qe). These values were found to give results accurate 
to within plotting accuracy. 


Ill. RESULTS 


As indicated in Fig. 2, the results are a function of four variables, y, 
drive, wo/wg3 ,and w3/o,. y determines the varactor type, graded junc- 
tion (y = 4) or abrupt junction (y = 3). 

The parameter drive is defined by (3). There is no theoretical upper 
limit for this quantity for the varactor model chosen; we shall arbi- 
trarily take 2 as the upper limit for our calculation. In a practical 
varactor the drive level will be limited by forward bias current due 
to the finite minority carrier lifetime; however, values of 2 are usually 
attainable. 

The parameter w3/w, expresses the effect of the loss R, for a practical 
varactor. 






READ IN VALUES OF y 


@ @ 
DRIVE, we De 






VALUE OF @ VALUES OF Qo AND Q3 


CALCULATE OUTPUT POWER 



























IS 
OUTPUT POWER 
A MAXIMUM FOR 
GIVEN @ ? 


COMPUTE OPERATING 
PARAMETERS 
PRINT RESULTS 


Fig. 2— Simplified flowchart of the computer program. 


IS 
OUTPUT POWER 
A MAXIMUM ? 





OVERDRIVEN VARACTOR UPCONVERTER - 1231 


The frequency ratio w:/o3; determines the ratios of the three fre- 
quencies, since wo; = o, + w2. Fortunately, it has been determined that 
the results can be normalized in such a way as to make them inde- 
pendent of this frequency ratio in most practical cases. These nor- 
malized quantities are as follows: 


normalized resistances 


Rw, Row. Rw 3 


eac! , eee i: Snes ‘ 
normalized elastances 


Si S2 S3 
nies : fo ete : Dinae 


Nee 


normalized powers 


PS cx P Snax Pace x 
(Ve — ®)°w (Vi, — ) "ws. (Va-= ®)'w, ‘ 
normalized microwave conversion efficiency and unconversion gain 
We @) 
N23 7 G43 ae and 
W3 Ws 


normalized bias voltage 


Vo — ® 

Vp — 
Results were computed for several frequency ratios. These ratios are 
designated by the harmonics present in the charge waveform, (5). 
Thus, for example, 1-7-8 denotes the case w2/w, = 7 and w3/o; = 8. 
Results were computed and compared for the cases 1-2-3, 1-8-4, 1-4-5, 
1-5-6, 1-7-8, and 1-86~87. Fig. 3 shows the dependence of normalized 
efficiency y23w2/w3 on the frequency ratio, for abrupt- and graded-junc- 
tion varactors at the two extreme drive levels for w3/o, = 0.001. Fig. 4 
shows the dependence of the normalized gain G'130;/o3 . Similar curves 
are obtained for the other operating parameters for o3/o, = 0.001. 
For o2/o; 2 5 the percent variation in normalized resistance is less 
than 2 percent, and the percent variation in normalized elastance and 
bias voltage are less than 1 percent. As the loss factor w3/w, is in- 
creased, several of the normalized quantities show invariance as a 
function of the frequency ratio w2/, similar to that described above 
for w3/o, = 0.001. However, at high values of o3/w¢ six of these quanti- 
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Fig. 3— Variation of normalized microwave conversion efficiency as a function 
of the three upconverter frequencies. Percent deviations are calculated with re- 
spect to the 1-86-87 case. ws/we = 0.001. 


ties are no longer invariant with the ratio o2/,. The six quantities 
which vary with w2/o; are listed below (with maximum variation in 
percent of the 1-7-8 frequency ratio value as computed for 3/w¢ 
=01)3 


(i) R,w,/Smax (26 percent) 

(it) Rew2/Smax (10 percent) 

(itt) PySmax/(Vz — ®)*w, (26 percent) 
(iv) P2Smax/(Ve — &)°w2 (10 percent) 
(Vv) — n23@2/w3 (9 percent) 

(vt) G1s3w1/w3 (85 percent). 


Fortunately, simple correction functions can be applied to obtain the 
variation with the ratio we/w;. Hence, all the results to be presented 
were computed using the 1-7-8 frequency ratios, and they may be 
considered to be applicable for w2/o; 2 5, except for the six quantities 
mentioned under high-loss operation, for which simple corrections are 
provided. 

The computed values of a for maximum power output are presented 
in Fig. 5. The computed values of normalized charge amplitudes for 
maximum power output are presented in Fig. 6 for the two extreme 
drive levels. As in the nonoverdriven case,‘ Q; and Qs are computed 
to be equal for maximum power output. Using these values and similar 
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FREQUENCY RATIOS (0), W2,W3) 


Fig. 4— Variation of normalized upconversion gain as a function of the three 
upconverter frequencies. Percent deviations are calculated with respect to the 
1-86-87 case. ws/we = 0.001. 


results for other drive levels, the operating characteristics of the 
upconverter are computed and presented below. 

Figs. 7 and 8 show values of maximum output power for abrupt- 
and graded-junction varactors, respectively. It would appear that the 
abrupt-junction varactor has a considerable advantage in power out- 
put. However, if one considers the power-impedance product P3;R3 
(using data from Figs. 17 and 18), the difference is smaller; in fact, 
for high drive levels the graded-junction varactor has a larger P3F3 
product. 








@ IN DEGREES 


=0.10~, 
0.06y 





Fig. 5— Computed values of the output current phase angle a for maximum 
power output. 
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Fig. 6— Normalized charge amplitude coefficients for maximum power output. 


The results for maximum power output at unity drive agree with 
the results of Nelson,‘ despite the fact that Nelson assumed a = 0 
for all cases. The effect of a nonzero value of a is most pronounced for 
highly driven abrupt-junction varactors. For example, Fig. 7 gives a 
value of PsSmax/(Vs—®)?o3 equal to 0.0327 for o3/o, = 0.001 and 
drive = 2.0, which is 16 percent larger than the value obtained assum- 
ing a equal to zero. 

The upconversion gain at maximum power output is presented in 
Fig. 9, and the microwave conversion efficiency is plotted in Fig. 10. 
The correction factors for high loss and o3 ~ 8, are described in the 














Fig. 7— Maximum power output for abrupt-junction varactors. 


OVERDRIVEN VARACTOR UPCONVERTER 1235 








ORIVE 


Fig. 8— Maximum power output for graded junction varactors. 


figure captions. For the graded-junction varactor, these curves show 
a definite advantage in operating at high drive levels. The curves are 
flatter for the abrupt-junction varactor and show maxima at inter- 


mediate drive levels. 
Values of input elastance S, and S, are presented in Fig. 11. These 
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Fig. 9— Upconversion gain for varactor upconverters operating at maximum 
power output. These results become inaccurate for high loss (high ws/we) and 
ws = 8. Accurate values of gain may be obtained from the values plotted by 


dividing by the correction factor 
@W3 8a, Sie 


i= 
8, Ry,’ 
where f1w:/Smax is read directly from Fig. 13 or 14. 
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Fig. 10— Microwave conversion efficiency for varactor upconverters operating 
at maximum power output. These results become inaccurate for high loss (high 
ws/wo) and ws 3£ 8w:. Accurate values of efficiency may be obtained from the 
values plotted by dividing by the correction factor 


Wa 8a, Dar 
De Bas? 


where Rew2/Smax is read directly from Fig. 15 or 16. 


results are essentially independent of the loss parameter w;/w,. Values 
of the output elastance S; are presented in Fig. 12. Since S; was defined 
as the quadrature component of varactor voltage at ws, Kirchhoff’s 
voltage law applied to the output loop requires that the load inductance 
present an equal but opposite reactance, i.e., L; = S3/w3. 





Fig. 11— Input elastance at radian frequencies w: and we. 
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Fig. 12 — Output elastance at ws. The load inductance is equal to S3/w:’. 


The input resistances R; and Rez are presented in Figs. 18 to 16. 
The correction factors for high loss and o3 8; are described in the 
figure captions. The load resistance Fz is plotted in Figs. 17 and 18. 
The abrupt-junction case has resistance maxima in the drive range of 
1.4 to 1.6. This correlates with the range for maximum gain and 
efficiency as given by Figs. 9 and 10, as we would expect from (17) 
and (19). Similarly, the graded-junction case has maximum values of 
resistance, gain, and efficiency at a drive of approximately 1.8. 



















0.20 
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Fig. 13 —Input resistance at w, for an abrupt-junction varactor. These results 
become inaccurate for high loss (high ws/we) and ws + 8w:. Accurate values of 
Riw:/Smax may be obtained by subtracting the quantity 

@W3 8a, 


8x, 


from the values plotted above. 
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DRIVE 


Fig. 14— Input resistance at :, for a graded-junction varactor. These results 
become inaccurate for high loss (high ws/w-) and w; 34 8. Accurate values of 
R1w:i/Smax may be obtained by subtracting the quantity 

w3; — 8a, 
8u, 


from the values plotted. 


Tf one considers the effect of the upconverter parameters upon the 
instantaneous bandwidth, he finds that large bandwidth requires 
large values of Rjw;/S;. Generally, the low-frequency , circuit is most 
crucial in this respect. Computing ratios of Ryo,/S; from Figs. 11, 18, 
and 14 one finds that the abrupt-junction varactor has a higher ratio 











Fig. 15— Input resistance at we for an abrupt-junction varactor. These results 
become inaccurate for high loss (high ws/w.) and w; ~ 8w:. Accurate values of 
R2we/Smax may be obtained by adding the quantity 

@3 — 8; 
8, 


to the values plotted above. 
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Fig. 16-—Input resistance at w2 for a graded-junction varactor. These results 
become inaccurate for high loss (high ws/we) and w; ~~ 8w:. Accurate values of 
R2we/Smax May be obtained by adding the quantity 


W3 — 8a, 


8w, 


to the values plotted above. 


at any prescribed drive level, albeit the difference becomes small (12 
percent) at high drive levels. At drive = 1, the ratio Ryo,/S, for the 
abrupt-junction varactor is nearly twice that of the graded-junction 
varactor. 

Fig. 19 gives the de bias voltage required for maximum output 
power. 








Fig. 17 — Load resistance for an abrupt-junction varactor. 
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0.25 


0.20 








Fig. 18— Load resistance for a graded-junction varactor, 


IV. CONCLUSIONS 


This analysis provides all the information required to design an 
upper-sideband upconverter for maximum power output, for abrupt- 
junction and graded-junction varactors. The results are sufficiently 
accurate for w/o, 2 5. The necessary load resistance and inductance 
are obtained from Figs. 12, 17, and 18. The input impedances at o 
and we are also presented, and the designer will ordinarily use this 
information to provide conjugate matching with the sources at these 
frequencies. Experimentally, the proper impedance matching condi- 
tions are facilitated by means of Swan’s small-signal matching tech- 
nique.® 

It has been assumed throughout that currents are present in the 
varactor only at the three frequencies corresponding to the signal, 
pump, and upper-sideband. 

At low drive levels, the abrupt-junction varactor provides both a 





Fig. 19 DC bias voltage required for maximum power output. 


OVERDRIVEN VARACTOR UPCONVERTER 1241 


higher power-impedance product and also a greater bandwidth than 
that of the graded-junction varactor. However, at high drive levels 
the difference between the two varactor types is very small. 

These results apply only to operation at maximum power output. 
In some applications, the designer is willing to sacrifice some power 
output in order to obtain a higher microwave conversion efficiency.® 
Such operation would require different operating parameters from 
those presented here. 


APPENDIX 


Efficiency and Gain Relations 


The tuned upconverter may be represented by the equivalent circuit 
shown in Fig. 20. In this equivalent circuit the losses are separated 
from the frequency conversion device, and the power ratios for each 
are computed separately. 

The power ratios for the lossless nonlinear capacitance are obtained 
from the Manley-Rowe relations:? 


< = MP hen = = 
2 n=—o MW), + No - 0 (26) 
2 piers My, oe NW =; 0, (27) 


where P/,,,, 18 the power into the lossless nonlinear capacitance at fre- 
quency mw, + nw, . Restricting the exchange of power tothe three radian 
frequencies of interest , , w., and w; = w, + we, (26) gives 

P! PP? 

oe ae (28) 


1 W3 


LOSSLESS 


NONLINEAR 
REACTANCE 





Fig. 20— Basic equivalent circuit of the upconverter. Energy enters the non- 
linear reactance at radian frequencies w: and we and leaves at ws. 
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and (27) gives 
PE Ps 


We W3 


0. (29) 


In both cases P{ is negative. If we redefine P{ as the output power, the 
last two equations become 


Ps _ ws 
Ps _ @s 
Poe (31) 


In the input circuit at w. , a fraction (Re—Rs) /Re of the input power 
reaches the lossless nonlinear reactance. A fraction R3/(R3+Rs) of the 
converted power reaches the load. Thus, the microwave conversion 
efficiency is given by 


_ Rk, -—#R,P3 iy 
73 R, PR, +R, 


(32) 
= w3 Re a R, R; : 
~ Woe R. R; + R, 
Similarly, the upconversion gain is given by 
Gis = wR, — Ry _ fs (33) 
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Two Theorems on the Accuracy of 
Numerical Solutions of Systems of 
Ordinary Differential Equations 


By I. W. SANDBERG 
(Manuscript received March 138, 1967) 


We consider the accuracy with which a numerical solution of the system of 
ordinary differential equations 
é+fa#d)=0, t=0 


can be obtained by the use of a numerical integration formula of the well- 
known type 


D D 
Ynti = a» QnYn-k +h >; b.yh—- . 
k=0 k=-1 


For the scalar case, under some natural assumptions, and assuming that 
a and 8 are real constants such that 


Med <5, 1x6 


IIA 


a 


at every point x, it 1s proved that if 


F¢) cy 1 _ > a, *r? + 4(a -- B)h > be x 0 
k=0 i 


for all | z| = 1, then (e), the root-mean-squared error over a given interval, 
between the true samples of x(t) and the y,, satisfies 


).2 (L+ 6)” min | Fee) |" @) 


in which p depends on a, B, the a, , and the b, , and (¢) takes into account 
the local roundoff and truncation errors as well as errors in the starting 
values for computing the y,. 
If the condition on F(z) stated above holds, and tf p < 1, then 
(ey) S (1 — p)* max | Fe'®) | @. 
Osws2r 


1243 
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The significance of the key assumptions is discussed and two exanvples 
are given. . 


I. INTRODUCTION 


In this paper we present some results concerning the accuracy with 
which a numerical solution of the system of ordinary differential 
equations 


z+ f(x, = 0, t20 [x(0) = a] (1) 
can be obtained by the use of a numerical integration formula of the 
well-known type? 


IV 


D Dp 
Yn+el = oy QYn-e th > bry % , nr Pp. (2) 
k=0 k=-1 
In (2) the y, are approximations to the x, + (nh), where h, a positive 
number, is the step size parameter; Yo, Yi, --- , Yp are starting vectors, 
the last p of which are obtained by an independent method; and 


4 


If b-, ~ 0, then yn+1 is defined implicitly, and (2) is said to be of 
closed type. It is assumed throughout that (2) can be solved* for 
Yn+1 for all n = p. Specializations of (2) include, for example, Euler’s 
method: 


Yn+1 = Yn + hy}, ’ 


and the more useful formula 


Ynsi = Yn + hyn + Ynsi)- 

It is assumed throughout that for t 2 0, f(z, t) is a well-defined real 
N-vector-valued function defined in the set of all real N-vectors z, 
that f(z, t) satisfies (the usual weak) conditions which guarantee the 
existence and uniqueness of a solution of (1), and that the Jacobian 
matrix df (x, t)/dx exists for all x and all t 2 0. 

Equation (2) ignores the roundoff error FR, introduced in calculating 
Yn+1, and, in order to take R, into account, we shall consider instead 


*Tt is well known that if f satisfies a uniform Lipshitz condition, and if h is 
sufficiently small, then (2) possesses a unique solution Y+1 which can be ob- 
tained by a simple iterative process.’? However, this smallness condition is by no 
means always necessary. For example, if b. > 0 and, with a as defined in Sec- 
tion 2.3, if a > 0, then for any h > 0 a unique solution yas: exists and can be 
computed by an iterative process which is only slightly more complicated than 
the usual one (see Section IV). 
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of (2): 


Yn+1 = Da AxcYn-k is h ey biYh-k + R,, ’ Nn = D. (3) 
k=0 k=~1 


We may also assume that FR, takes into account the error in solving 
(2) for Yn41, caused typically by truncating an iteration procedure 
after a finite number of steps. Predictor-corrector techniques can of 
course be viewed as giving rise to a degenerate (often one-step) 
iteration technique in which the “starting point” is generated by the 
predictor. 

The truncation error 7, , a basic entity associated with the integra- 
tion formula (2) and the differential equation (1), is defined for 
n = p by the relation 


Pp Dp 
Tri = Dy Ate bh DY bar. tT, nep 
k=0 k=l 


in which 7% = —f(z,, nh). Clearly, 7, is a measure of how well the 
samples tp—p, Un—-p+1) *** 5 Un+1 Of the solution of (1) satisfy the integra- 
tion formula. The problem of estimating 7’, is a classical one, and there 
are standard methods which lead to precise bounds.*”” 

We now define a set of vectors {g,} which plays a central role in the 
subsequent discussion: 


g, = T, — FR, , n=pD (5) 


8 


Pn = (Baey = ci) — Oyn(Vn—% ee Vin‘) 
k=0 


+h YY belfltae , @ = WA] = Tyan s (0 = HAY, 


=< (6) 

, =I, er =D 
(with the understanding that t%, = yn = f(a», nh) = f(yn, nh) = 0 
for n < 0). Note that the 9, forn = —1,0,---,(p—1) are measures 
of the departures of the starting vectors from the exact values, and that 
yn = 0 forn = —1,0,-+ +--+ , (p — 1) if the starting vectors are exact. 


We shall describe our results first for the scalar case (i.e., for 
N = 1). 
II, RESULTS 


Let e, denote (%, — yn), the difference between x(nh) and its com- 
puted approximation. Suppose that N = 1, and that e and £ are real 
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constants such that 





oe Of(x, t) 


ae ee (7) 


for allt = 0 (at every point x), and that 


I|> 


F@ =1- > az **) + dat Bh > be Oe 0 (8) 
k=0 k=-1 


for all | z| 2 1 (including “z = o’’). We prove that then 


L 
2 
’ 


& (ar +9" ¥ Jen f) (9) 


the root-mean-squared value of the first (MW + 1) error terms [M is an 
arbitrary positive integer greater than or equal to (p + 1)] is bounded 
from below in terms of 


i 
2 
, 


& (arto Sle) (10) 


the corresponding quantity for the y’s, in accordance with the inequality 
(e) = (1 +p) min | FE’) |" &). (11) 
Osws2r 


[F(z) is defined in (8)], in which 


Dp 
>» b gerne 
Kk 


k=—1 


Fee'’) 


A 








3(8 — a)h max (12) 


Osws2r 


p 


We also prove that if in addition to the assumptions stated above, 
we have p < 1, then the sequence {e,} is bounded (i.e., there exists a 
positive constant c such that |e, | Sc for all n 2 0) whenever the 
sequence {g,_1} is bounded, and 


(e) $ (1 — p)” max | Fe’) [7 @) (13) 


(whether or not {¢g,—1} is bounded). 

Inequality (11) provides a limitation on obtainable accuracy under 
essentially only the weak assumption that the sequence of approxima- 
tions {y,} defined by (2) approaches zero asn — © for all sets of starting 
values when f(z, t) = 3(a@ + 8)zx. Since, by assumption, F(e*’) ¥ 0 for 
0 S w S 2r, it is clear that p < o. 
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The condition that p < 1 is satisfied if and only if the locus of 


D> 
2 ae ae ae 
1) lamar ean (14) 
» b,e** 
k=—l1 
for 0 S o S 2z lies outside the “critical circle” C of radius 4(@8 — a)h 
centered in the complex plane at [4(a+)h, 0] (see Fig. 1).* 





Fig. 1— Location of the critical circle C (for N = 1). 


Since 
p = 3(8 — a)h{min | O@) — $@+4+ Ah |}, (15) 


we see that p is the ratio of the radius of C to the distance between 
c and 6, where c is the center of C and 6 is a point nearest c on the 
locus of 6(). 


2.1 Discussion 


The quantity (e) of course of interest in problems in which we 
are concerned with a measure of the accuracy of a numerical solution 


* If a > 0, we can express both the conditions that p < 1 and F(z) ¥ Ofor |z| = 1 
entirely in terms of a condition on the locus of O(w)~! for we [0, 27]. The requirements 
on F(z) and p are met if the disk of radius 3[(ah)-! — (gh)-"] centered at 
{3[(ah)—! + (gh)“], 0} is not “encircled” or intersected by the locus of @(w)-!. There 
is a complication that arises as a result of the fact that O@(w)—! is typically not bounded. 
This complication often stems from a “consistency requirement” which implies that 
1 — SP x20 ax2z~(*t) has at least one zero on the unit circle. However, due to also 
typical ‘‘convergence requirements,” 1 — 30?z<9 axz—**) normally has only simple 
zeros on the unit circle, a fact that can be used to suitably define what is meant by 
the locus of ©(w)—! not encircling the disk. We leave the details of the necessary 
“principle of the argument” argument to the sufficiently interested reader. 
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over a large number of steps, as opposed to the accuracy of some final 
value obtained at the end of a large number of steps. 

Although there is a vast and interesting literature concerned with 
various aspects of the problem of error estimation in digital computa- 
tion (see, for example, Refs. 3, 4, and 5), the results presented above, 
and their proofs, appear to be most closely related to earlier results 
concerning the input-output stability of continuous-time nonlinear 
feedback systems.** Indeed, the writer is not aware of any lower- - 
bound results in the numerical analysis literature of the type described 
above. However, some upper bounds concerning, (2) of (for example) 
the form |e,| = K with K independent of n (which imply (e) S K) 
have been obtained in certain cases. In this connection, our condition 
that guarantees the boundedness of {e,} is often weaker, and our upper 
bounds on (e) are often much stronger, because, for example, the 9-1 
can become very small as m becomes large. 

Our approach can be applied to several other problems in numerical 
analysis. In particular, with reasonably direct modifications of our 
proofs, analogous theorems can be proved concerning the numerical 
integration of systems of second-order ordinary differential equations. 


2.2 Examples 
Euler’s Method: Yui: = Yn + hy! 


Here F(z) = 1— [1 — 3(a@+ A)h]z™, so that F(z) ¥ Ofor|z| 2 lifand 
only if0 < 4(a+ B)h < 2, and| F(e*®) | = | 1 — [1 — (a+ B)h]e"*® |. 
Thus (with 0 < #(a + B)h < 2), 


min | Fe’*) | = [1+ |1—-4@4+ 8h |]J- 


max | F¢e’*) |’ = [lL — |1—-4@+6)h lJ". 


The locus of © is the circle shown in Fig. 2, since O(w) = 1 — e”*°. If 
ah > 0 and Bh < 2, then the critical disk (Fig. 2) is not intersected 
by the locus of ©, the condition that 0 < $(a + B)h < 2 is satisfied, 
p < 1, and in accordance with the last paragraph of the section preceding 
Section 2.1: 


p = 3(8 — a)h max ([3(6 + a)h]’, [2 — 2(6 + a)h]’). 





*Tt is interesting to note that the possibility of exploiting feedback-theoretic 
ideas has been emphasized by Hamming? 
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Fig. 2— The locus of @(w) for Euler’s method, and the critical circle C. 


If0 < (w+ B)h <2, then 





min | Fe’) |" = [2 — 44+ pay’, 
max | Fe’) |" = Be@t pay 
p> pe! 


and 


(e) 2 1+ (6 — a) (6 +4) ")"[2 — 3@ + AAT) 
(ce) S$ [L-— @— a6 +)"J"B@ + Ah"). 
For estimates of 7, , see Ref. 1 or 2. 


As a remark concerning the necessity of the condition p < 1, we note 
that if ah > 0, but Bh > 2, then for even the special case in which 
f(x, t) = Bx, we have e , & , €2, °-» unbounded, since yo, Yi, Y2, °° ° IS 
unbounded (assuming merely that Yo FO). 


The Formula Yas: = Yn + 2hyn + Yhar): 
In this important case 
F@ =1+3e@4+ Ba — [1 — tat Bhi, and 
Ow) = eS = 27 tan (“). 
(w ) = (1 + ey 2 


We have F(z) 0 for |z| = 1 if and only if (a + B)A > 0, and [as- 
suming that (a + B)h > 0]: 
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min | F(e") [~ = [1 + F@+ Bh + | 1 — 2@ + Bh IP" 
max | Fe") [= [1+ i@+s)h—|1—-tet+ pl. 


The locus of © lies entirely on the imaginary axis of the complex 
plane, 





p = B+a’ 
and obviously p < lifa> 0. 
Ife >0Oand (a+ B)h < 4, then 


min | F(e'*) | = 4 
max | Fe'*) |" = [3@ + bay" 
and 
e) 2 [1 + @ — a)(6 + a)" 3%) 
) S$ [1 — @ — a) +) ") "EE + BAy “e). 
The last inequality can be written as simply 


(e) S @h)“@). 
For the integration formula under consideration, 
_ Wea!!"(n) 
og 12 
where »» lies in the interval [(n — p)h, (n + 1)h]. 
Here p = 0, and 
g-1 = (Xo — Yo) + Zhl[f@o , 0) — fyo , 0)). 


Thus, assuming for the purpose of illustration that roundoff errors can 
be neglected: 


M 


@) s (aay*( au Ply ea Para 2 


m=1 





hea!’ (nm) ) 
12 

provided that a > 0 and (a + B)h < 4. If, for example, (8 — a): 

(8 + «7 = 4 and 4(a + B)h = f, then the ratio of our upper bound 

on (e) to our lower bound is 18. If (8 — a«)(B + a«)* = 3 and 

4(a + B)h = }, then the ratio is 42. 
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2.3 Results for the Vector Case (N = 1) 


We shall state our results for N = 1 in aslightly more formal fashion. 
Definitions: 


(z) Let ||q|| denote (>°*., q?)? for every real N-vector gq = 
(Q1,Q25 °** 5 Qn) 

(iz) Let {df(a, t)/dx},5 and {df(x, t)/dx}, denote, respectively, the 
symmetric and antisymmetric (i.e., skew symmetric) part of Of(z, t)/dz, 
the Jacobian matrix of f(z, t). 

(ii) Let F(z) = 1 — SOP, ae? + Ba t+ Bh DP, ee 

(iv) e, = x(nh) — y,,n = O with the y, forn = (p + 1) defined by 
(3). 

(v) With M an arbitrary positive integer such that M = (p + 1), let 


4 (art y7 S Hen IF) 
@) & (arty? Sew IF), 


where the g,,-; are defined in (5) and (6). 


Assumptions: 


Let the smallest eigenvalue of {df(«, t)/dx}s be bounded from below 
by the real constant a(a > — ~) for allt 2 0, and let the largest eigen- 
value of {df(a, t)/dx}s be bounded from above by the real constant 
B(B < ) for allt = 0. Let the modulus of the largest eigenvalue of 
{Of(, t)/dx}, be bounded from above by the real constant y(y < ©) 
for all ¢ = 0. 


Definition: 
Let 
p = [3(8 — ah + yh] 


Dp 
a b ge 
k 


ka= 
- max : 


SA es iy te ee ee OW ee oe ot, A oe 
OSws2r | 1 ee » Gee a 4a + B)h Ss bie weed 
k=1 


k=0 


Theorem 1: If 


(¢) the assumptions of Section I concerning f(x, t) and (2) are satisfied, 
(7) 1+ $(a@ + B)hb_, ¥ 0, 
(it) F(z) # O for all|z| = 1, 
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then 
(e) = (1 + p)™ Ree | Fe'*) |"* @). 

Theorem 2: If (¢), (it), and (cit) of Theorem 1 are satisfied, and if p < 1, 
then 

@ SA — oy" max | Fe) 1" &), 
and 

(it) sup || @m-1 || < c implies sup [] nm || < o. 
Corollary to Theorem 1: If (i) of Theorem 1 is satisfied and there 


exists at least one real constant k, such that 


1- Sag“? +hh DS be"? #0 
k=0 k=—1 


for all | z | = 1, then there exists a positive constant k2, which depends 
only on do , di, ***, Gp, by, bo, +**, by, a, B, and y such that 


(e) 2 kefy). 


Theorems 1 and 2 are proved* in the following section. The proof 
of the corollary is very simple. 


Since 
1—- Sag? +hh SS oe 40 
k=0 k=-1 


for all | z| = 1, there exists a k{ such that 


1—- Dag? +kh DS oe"? «0 
k=0 


k=-1 


for all | 2 | 2 1, and 
1+ kjhb_, ¥ 0. 


Choose a’ and @’ such that a’ S a, 6’ 2 B, and 4(a’ + 6’) = ki. If we 
replace a and 8 with a’ and £’, respectively, we see that Theorem 1 
applies. 


* Our proofs actually yield sharper, but less explicit, bounds on (e) than those 
of Theorems 1 and 2. See (31) and (87). 
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III. PROOFS 


Proof of Theorem 1: 


We have 
Pp D 
Yati = >= AxYn—n +h biyr—x Li ges n= Dp 
k=0 k=—1 
and 
Dp Pp 
nti = >» ApUn—k =e h oS bpth—x a iis ’ nN = Pp 
k=0 k=-1 
Thus, 


Dp 

Cati = Dae AxEn-k - 
k=0 
Dp 


—h ps b. {flan — DA] — flr, —-DAl} +e, n=p 


=-1 


and, with ¢, defined for n = —1,0,-::, (op — 1) by (6), 


Cr = > Alri-n —h DD def flere, (2 — 1 — WAI 
k=0 k=-1 


_ flyn—1-% ,(n eit ieee k)h]} SH Pn-1 (16) 


forn = 0. 
* A 
As a matter of convenience we define a_, = 0. 


Lemma 1: There exist real sequences {w,}?_. and {v;,}f-. both belonging 
to l, (i.e., with the property that >>*., | wi. | < © and <2, |v, | < ©) 
such that 


—h »~ be etP 





Wak Syers sia re (17) 
Be es 2 la, — F(a + B)hd, Je **” 
V@ = 2 ne” = a (18) 
a 1 pas [a, — 3 (a + p)hb, |e **? 
k=-1 


for all|z| = 1. 


The proof follows at once from the standard theory of z-transforms, 
in view of assumptions (77) and (iz) of Theorem 1, The details are 
omitted. 
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Lemma 2: Let 6, = f(x, , nh) — f(y,, nh) — (a + B)(a, — yn) for all 


n 2 0, and let {w,} and {v,} be as described in Lemma 1. Then 


en, = D> Wr-k Ox - > Vn—kKPk-1 
k=0 k=0 
forn =0,1,°:-, M. 
Proof of Lemma 2: 


From (16) and our definition of 8,: 


D 


Cn = ss [a:, aie 3(a + B)hbyJen-1—1 


k=-1 


a h y by On—k-1 + On-1 3 n 0. 


k=-1 


IV 


We multiply both sides of (20) by e~’"® and then sum from n 


ton = M to obtain 


M 


P Mo 
SP Mia om = > [ay ae $(a + B)hb;] De enka 


n=0 


Pp M : M ; 
pes Pe bh we a a De aiid ve 


k=~1 n=0 n=0 


for all w « [0, 27]. Using e, = 8, = 0 for n < 0, we have 


M M M . 
» i) ae = g tiie > ee es prints ye erie. 
n=0 n=0 n=M~k 
and 
M P ; M f ; M ; 
SS go eee = gee > ae 5, es gure » go ine 5, : 
n=0 n=0 n=M-—k 
Thus, from (21), (22), and (23) 
M 


M M 
ev te. = Wee'’) > ei 5, + V(e*”) 2 aber meh 
n=0 n=0 


n=0 


Pp M : 
+ Ve)h De be >y pore 5, 
k 


=-1 n=M-k 


>? : M : 
_ > [a, es Aa ae B)hb, Je 6° *”* > rive) 
k=-1 k 


n=M—k 


for all w € [0, 27]. 


(19) 


(20) 


| 
o 


(21) 


(22) 


(23) 


(24) 
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The expression within the braces in (24) can be written as 
> ae tte 
n=0 


with s, = 0 forn = 0,1, ---, M and forn > (1 + M + p). Since 
{v,} el, , we have 


Ms 


C-) n 
Vier”) p> ser" = ein 2 Vy—-2Sh » 
= = 


Nl 
o 


n 


Similarly, 


. M . i] n 
Ve'*) >» ae ee = > enn » ee Coe he 
n=0 k=0 


n=0 


in which 


(Gx-1)ar = Peni ; kx M 


= 0 k>M 
and finally, since {w;} el; , 
f AY < kad Fs bd 
We'*) hem 6, = hem > Wn—e( Ox) ar 
where 
(6x) ar = 6; ’ k Ss M 
=0, -k M 
Thus, 
M ; ive) ' n 
pS ee, = » e™ Wr—n( Ox) 9 
n=0 n=O k=0 
a x eine > Un—e(Ou—1) ar 
n= =0 


i] 


n 
ae a > Vn—KS;, 
0 k=0 


for all we [0, 27]. Since 


> Vn-18, = O 
k=0 
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forn = 0,1, --:, M we have* 
_ > Wn-z 8, + dS Un—kPk—-1 (25) 
k=0 k=0 
for n = 0,1, -+-, M. This completes the proof of Lemma 2. 
Lemma 8: If (19) holds forn = 0,1, --:,M, then 


























2\4 M 3 M n 2\i 
(> 2 KPe—d ) = bs || en iF) +(> Do Wn 6 ) (26) 
n k= n=0 n=0 k=0 
and 
M 2 M n 2\1 M n 2\ 4 
(3 ett) 3(3 |] S wee |[) + (5 |] Soe |) en 
n= n= =0 n= :=C 


























Proof of Lemma 8: 


Inequality (26) or (27) follows from (19) by two applications of 
Minkowski’s inequality. We leave the details to the reader. 
Inequality (27) is used only in the proof of Theorem 2. 


Lemma - : 


> Wn-k Ox ee max we) |’ = | bn 





















































n=0 Osws2r 
Proof of Lemma 4: 
By Parseval’s identity, 
2 1 Qu. M : n 
> > Wr-k On =a > as SS Wn-k or dw 
n=0 20 n=0 k=0 
1 Q2r MD n 
ao On 2 mate, du Wr (6x) ar 
in which 
(6x) ar = 6. , ksM 
= Q), k>M., 
Therefore, 
2 1 20 ve) ; n fo 
— ee —iwn 
» ps Wrz Ox => mal Le 4 Wr—K( Or) ar 























*We could have obtained (25) from (20) directly using standard z-transform 
theory, if we had introduced further assumptions which guarantee that the se- 
quences {yn}, {z(nh)}, and {yz} are transformable. However, this would have 
complicated the statement of our results and would have weakened them in a non- 
trivial manner. 
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But since {w;} ¢€ l,, 


n lo 2) 
—twn bath —twk 
Wn—-n( Ox) eS Se Un De 6; 
n=0 


Ms 


























n=0 k=0 k=0 
= Wei’) Ye —iwk 5, 
k=0 
Thus, 
1 2a ; Af ; 2 
3 ps Wn-k Ox i a7 We'®) > go Ox du 
n=0 a7 Jo k=0 
27 M : 2 
< max | We'*’) P= ral dei 6, I] de 
OSws2r k=0 














= max | We") | >I bz IP 


O0sw<2 


which proves Lemma 4. 


Lemma 6: 


IA 


M 3 M 1 
(S Wa.1F) < 86-4 +n(¥ Heth): 
Proof of Lemma 8: 
We shall prove that 


[| 6. || S$ @ — @) +] lle || 
forn = 0,1,-°:, M. 
By definition 
I] 6, |] = [| f@, , nh) — Fyn nh) — 3(@ + B)@r — Yn) |]. 
Let g(a) = flax, + (1 — a) yn, nh] fora e [0,1]. Then 


lA 


oq 

da 
in which f’[av, + (1 — a)yn,, nh] denotes the Jacobian matrix of 
f(x, t), evaluated at x = ax, + (1 — a)yn, t = nh. Now, since g(1) — 
q(0) = f(t, , nh) — f(yn, nh), we have 


= f'lax, + (1 — ay, , nh\(t, — Yn) 


[2% da = Hox nb) — flan nt). 
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Therefore, 


Wat=|| [rte + ayy nhl — (+ ota} dates ~ 99 





in which 1y denotes the identity matrix of order N. 

For H an arbitrary N X N matrix, let || H || + (largest eigenvalue of 
H*H)', in which H* denotes the complex-conjugate transpose of H 
(i.e., let || H || denote the “spectral norm” of H). Then 








[| 6. || S [ {f’laz, + (1 — a)y, , nh] — 3a + B)1y} da 


|] — Yn II (28) 


IA 


s [ | flat, + (1 — ayn , nh] — 3(@ + B)Iy || da || e ||. 


With {f’}s and {f’}4 , respectively, the symmetric and antisymmetric 
parts of f’, we have 


lf’ — 4@+ Bly |] S |] fs — $+ Bly || + I] ba II. 


For each a e [0, 1], there exists’ an orthogonal matrix 7, such that 
Tilf'}sTr! + D = diag (f,, t2, --: €y) in which, since ¢; is an eigen- 
value of {f’}5, 


ag;S8 
forj = 1, 2, --- , N. Using || 7, || = |] Ty? |] = 1, 
[lf }s — 3@ + B)ly |] = || Tr’ DT, — 4@ + 6)Tr'T: || 
<||Tr D-3@+ TT |l-llT Il 


S || Tr" |[-l| D — 3@ + Aly I-11 7 || 
= max | ¢; — 2(@ + 8) 
< 3(6 — a). (29) 


Thus, || f’ — 3(@ + B)ly || S$ 3(6 — @) + |] tf}. Il. 
Consider || {f’}4 ||. For each a e [0, 1] there exists’ an orthogonal 
matrix T, such that T.{f’},T7;' + S is a direct sum of 2 X 2 block 


matrices of the form 
B, = 0 a 
—b; 0 
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and, if N is odd, and “1 xX 1 matrix” containing the zero element. 


Clearly, 
2 
B*B; = E 4 | 
0 2b; 


S*S is a diagonal matrix, and its largest element is not greater than y’. 
That is, 


fa ll = |] 72°87 |] Ss [| Sl] Sy, 
and consequently 
lf’ — 2@+ B)ly || S$ 36 —a) +7 (30) 
for all a e [0, 1]. Finally, from (28) and (30) 
[| 5. {| S$ 2 — a) +] [Ie |]. 


At this point we have proved that with p as defined in Section 2.3 


6 Jysata(Stlelt).  @D 


n=0 
We now need the following result. 


n 


Un-KPk-1 
k=0 























Lemma 6: 
3 Dau : | 2 min | Fe’) |? > eee 


Proof of Lemma 6: 


Consider S = 0%, || Do2<o n—-2cz ||’, with the ¢,’s N-vectors. Choose 
Curei, Caza, °° * SO that 


DS vn 2h = 0, n= (M +1). 
k=0 


This is possible since vp # 0 [vp = lim V(z)]. Then 


270 


2 > Un—KCk 


























n=0 (32) 
pe grien 
= on ; de 3 VUn~ KCK da. 


Under the assumption that 


Dd Ile ||? < », 
k=0 
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Wwe can write 
oo n oo ; 0 ; 
> gee > V,-iCe = > een » grr (33) 
n=0 k=0 n=0 k=0 


in which the last sum over k is interpreted as the usual limit in the 
mean. From (32) and (33) 


























20 ive) [oe] 2 
S= a we, Doe te |] dw 
2m 0 n=0 k=0 
; 1 2r ce) . 2 
2 min | Fee'*) re | Sei e, || des 
O<ws2r T Jo k=0 (34) 
=. mine (| Me) (7 2.| | ce | 
Osws2r k=0 
; MM 
= pon te) | Vice |. 
Osws2r k=0 
We now prove that 
DS ladys es 
k=0 
Let 
Gc. D> testes). te 0. 
k=0 
Of course: g, = 0 forn = (M +1). 
Let K be an arbitrary positive integer. Then 
K ; n K : 
» Pe » Vy —1Ce = >» aaa ; 
n=0 k=0 k=0 
With 
(Cx) x = Cx, ksk 
= 0, k>k 
we obtain 
[oe) n me) n K 
2 ote > Un—K(Ca) 2 er" DD Vy) KK = Ds ae . 
n=0 k=0 K+i k=0 n=0 
Therefore, 
Kx . P . 
> et a E = ~~ [ay = 1a ae aynb,Jen** | 
k=0 k=-1 


ice} 


. SS on > Vn-e(Ce)x + Fe'”) a ee (35) 


K+1 
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The first term on the right side of (35) can be written as 


ioe] 


Seer de 


K+1 


Thus, 


IIA 


Sle P+ 3 Wd I 


if” 
2r 0 


¥ Ie Il 


I 








se ate SLE 
Fe’) >: gags | dw, 


n=0 


from which we obtain 


K 
ba || Cr |’ 
k=0 


IIA 


max | Fe’) ? >> || a || 
OSws2r eae (36) 


lA 


M 
wmax | Fe'*) / 27 || g IP. 


Since (36) holds for all K > 0, we have completed the proof of Lemma 
6.* 
Inequality (81) and Lemma 6 prove Theorem 1. 


Proof of Theorem 2: 
By Lemma 3 [inequality (27)], Lemma 4, and Lemma 5 of the 

preceding proof, we have 
M 


(S tle lt) oS te it) +( 


n=0 














n 2\4 
Ze Vn—-kPk-1 ) : (37) 
=0 


Furthermore, by essentially the same argument used to prove Lemma 
4, we find that 


M n 
> > Un-KEk-1 
=0 


n=0 
Therefore, with p < 1, 














2\4 ; M 3 
) < max | Fe’*) |’ ( [] ee I") 


Osws2r 


(S thet) $@— 97 max [re P(E Hee I). 


We now prove that sup || ¢,-1 || < © implies sup || e, || < «©. Assume 
n20 nz0 


* Alternatively, we can show using (35), that only a finite number of cx are 
nonzero. 
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that sup || ¢n-1 || < ©. Let 
nz=0 


n 
Un, = ba VnerEk-1 - 
k=0 


Then we have 


ea See ae ae n=0,1,2,---,M (38) 
5x = fe kh) — flyx , kh) — 3@ + BY — y) (39) 


in which, since {v,} € 1, , sup || uw, || < 0. 
n20 


—Can 


There exist positive constants c, and c, such that |w,| S ce 
for all n = 0. By continuity, since p < 1, there exists a a € (0, c2) such 
that 


po = max | We) | [2 —«) +7] <1 


OSwS527 


in which of course 


We'?~" = 3 wee 
n| . 


n=0 


From (88) and (39): 
é£= > t_.4+%, n=0,1,2,-:-,M 
k=0 


6, = f(&: ’ kh) a CP ’ kh) = (a + B) (&x =? Gx) 
where é, = €'"€,, On = €''W, , tin = C°'Un, Op = CMO, , Be = CM, He = 
ey, , and f(q, kh) = e“f(e"“q, kh) for all N-vectors q. 
The Jacobian matrix f’ of f is related to the Jacobian matrix of f by 
f'(q, kh) = f°" q, kh) 
from which we see that f’ satisfies the assumption concerning f’ relative 
to the numbers a, 8, and y. Therefore, by the proof of Theorem 1, 


(Steir) sa- (Stair) 


=0 


1 
2 


for all M 2 (p+ 1). 
Now 


? 


M 3 M 3 M 3 
(5 tha IP) = (5 tem IP) S sup [me 1 (Se) 
n=0 n=0 nz0 n=0 
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— cw 1’ 
su Un Ne ee th 
pe | ! e" , 1 


< 
o(AL+1) 
< su 2 || 
me fe I| | (e*” — 1) 
and so, 
(2 ilalr) s os | 0) 
(1 — p,)” >> su di. 
[| &. || pe) (ee yi UP || 
for all M = (p + 1). 
From (38): 
Hear || s + sup || u, |]. (41) 
n=O 














We shall now use (40) to bound the first term on the right side of (41). 
Using the Schwarz inequality, 


M 
> Warn Ox 
n=0 


MM 


Dy Bart bs 
oS jap) ( IF) 


et AM 























(42) 


‘ 


By the proof of Lemma 5, 
I] & || 


IA 


[3(6 — a) + y] |] & Il, 
which leads to 


M 
SZ Wu—-r Oz 
k=0 








e’ .. Oo 4 
= @ pi bb - 4) + y](1 — p,)7* sup || w, || (> |, [) 
Since o ¢ (0, c2) [see the paragraph below (39)], |@,|* ¢ 1, , and therefore, 


3 Warn Ox 
k=0 


Finally, from (41) and (43), we have sup || e, || < 2%, which completes 
n20 


sup <= (48) 


M>(p+1) 














the proof of Theorem 2. 
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Iv. A CONDITION FOR THE SOLVABILITY OF (2) FOR Yn41 


The problem of solving (2) for y,41 is that of solving the equation 


y+ hbify,) = 9 (44) 
for y, with g a given N-vector. We write (44) as 
Qa 9 - (45) 


in which the operator Q is defined by the condition that Qu = =v+ 
hb_,f(v, €) for all real N-vectors v. 

We prove below that (with (-,-) denoting the usual inner product of 
real N-vectors): 


(Qua — QYs , Ya — Yo) 2 Ay [| Ya — Yo II’ (46) 
I] Qye — Qys || S he || ye — ye II (47) 


for every pair of real N-vectors y, and yy, in which k; = (1 + hb_a) 
if b, 20, hk; = 1 + ABB if b, SO: andky = {1 +h] b,]° 
[max (|@|,|8|) + y]}. Since kz < 00, according to a special case of 
Theorem I of Ref. 8, if k; > 0 (eg., if bee = 0 and 1 + hb_ia > 0), 
then (45) possesses exactly one solution which can be determined by an 
iteration procedure that is only slightly more complicated than the 
usual procedure!? (which is valid only under much stronger conditions 
on h). 
To derive (46), let 


qn) = nYa + (1 — nyo + hb-af[nya + dA — ny, 8 
for 7 « [0, 1]. Then 
q’(n) = (Ya — Yo) + AD-af’ [nya +L — ays, H(ya — Yo), 

and so 

Qys — Ws = / q'(n) dn 
: (48) 

= [fy + hbaf fave + = aye Wie — 9s) de. 

Thus, 

(QYa aes Oy. »Ya = Yo) 


= (f {1y + Ab-if’[nye + (1 — nyo, tH} anya — Yo), Ya — w) 
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[ ({1y + hb_if' [nya = el 7s MYo ? HW} (Ya a Yo) (Ya = eas Yo)) dy 


Il ve — ye II 
1 
-+ hb_, / Cs lnYa + Gl — mYo , Na _ Yo) (Ya = Yo)) dn, 
0 


in which f{£ denotes the symmetric part of f’. Thus, since the eigenvalues 
of ff are bounded from below by a, and from above by 8: 





(QY. — Qy» Ya a Yo) = (1 ae hb_1a) || Ya — Yo LI’, by = 0 
| > (1+ hb18) |lye-— yw lP, b+ $0 

Consider now the derivation of (47). By (48), 

I] Qy. — Qys || 
aa [ {ly + hb_if’[nye + (1 — n)ys , 4} dna — Yo) | 
|| [ts + arte + = dus at de [ine — 9 | 








ff liv + hbaf'tne + — aye A Ida ll ye — Il 


But, with f{ the antisymmetric part of f’, 

[| lv + Adaf’ [nye + 1 — ays, All| 
Lth{ bal (fly. + - aye, All 
1+ A |b [| [fs |] a | ba Pll fa II 
1+h |b. |max(Ja],[ 8) +h] b-. | 7. 


IA IIA 


IIA 


Therefore, 
I] Qy. — Qy I] S (1 +h | b+ | [max (Ja ], | 6) +7] Il ye — w II 
which is equivalent to (47). 
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Design Considerations for a Semipermanent 


Optical Memory 


By F. M. SMITS and L. E. GALLAHER 


(Manuscript received April 7, 1967) 


The potential of high-speed optical memories using electro-optic or 
acousto-optic light deflection for address selection ts examined. It is shown 
that for such memories the total memory capacity decreases as the third 
power of the addressing rate and that capacities in excess of 10° bits are 
feasible with a random access rate of 10° addresses/sec. 

A specific semipermanent memory design is then described which uses a 
laser light source, an acoustic xy light deflector and an array of 10* holograms 
as information storage elements. Each storage element contains 10* bits 
which appear as a pattern on a semiconductor read-out matrix when the 
storage element is illuminated through the xy deflector. Accordingly, the 
system has a total capacity of 10° bits with an access time of less than 10 psec. 


I. INTRODUCTION 


The increased computational capabilities of modern day computers 
are accompanied by a sharply increased need for a semipermanent 
memory to store their base programs. The capacity requirements for a 
given base program memory generally increase as fast or perhaps faster 
than the computational complexity of the computer increases. 

Along with the increased capacity, the next generations of memories 
must have increased speed capabilities since the speed of the program 
memory generally controls the overall computer processing speed. 
Ideally, a program memory should have a random access capability 
to each instruction or to each group of instructions if the instructions 
can be suitably grouped. 

If individual instructions are accessed in one memory cycle, the 
speed of the program memory has to match the rate at which the 
desired instructions have to be made available to the logic or processor 
frame. If, however, a group of instructions can be obtained in one access 
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operation in the program memory, then the number of groups which 
must be accessed per unit time is reduced by the number of desired 
instructions contained within each group. This concept is usually 
referred to as paging; that is, each access operation within the memory 
selects a page, or set of instructions, rather than a single instruction. 
Operations within the memory output circuits then transfer the de- 
sired instruction from the accessed page, or pages, to the logic frame. 

The twofold requirements of high speed and large capacity for a 
semipermanent memory suggests a review of the status and the capa- 
bilities of optical memories, particularly in view of the progress made 
in coherent optics within the past few years. The high speed require- 
ments restrict the considerations to systems with access times well 
below 1 msec which make mechanical motion in the address selection 
impractical. The flying spot store was the first fully-developed optical 
memory with a short access time in the order of a few microseconds.? 
Recent publications in the field include studies of electro-optical sys- 
tems,?:34°.67 holographic systems,*® magneto-optical systems,®:1%11-12 
and ferro-electric optical systems. 

With a view towards high speed and high capacity a system using a 
laser light source and solid-state light deflectors appears particularly 
promising. This combination permits one to direct highly collimated 
and coherent light beams to a given address. Two suitable methods for 
deflecting light are available. They are the digital light deflector!+%® 
and the acousto-optic deflector.2 The high light intensity available 
from lasers, in conjunction with semiconductor detector arrays, makes 
it feasible to increase substantially the number of bits stored at each 
address as is required for page organization. 

In order to assess the potential usefulness of such a system, the 
achievable speed and memory capacity should be examined on as 
general a basis as possible. Such an assessment is presented in the 
following section of this paper. To illustrate the potential of optical 
semipermanent stores this is followed by a description of a specific 
system which is based entirely on present technology and which has a 
storage capacity of 10° bits, divided into pages of 10* bits each with a 
page access time of less than 10 psec. 


II. STORAGE CAPACITY AND SPEED POTENTIAL 


The total number of bits that can be stored in an optical memory of 
the type discussed here has an upper limit that depends on the speed 
of the memory. This limit is set by physical limitations in the address- 
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ing rate as a function of the total number of addresses and by the 
number of bits at each address which for a given interrogation rate is 
limited by the light intensity. 

For both methods of light deflection the ultimate limit in the address- 
ing system can be expressed in terms of a number for the capacity 
speed product CSP = N,3 v4, where Nj is the total number of addresses 
and vg is the rate (addresses/sec) of random addressing.* At low 
addressing rates an upper limit on N, is set by the diffraction limit. 

Kurtz?* discussed the capacity speed product for the electro-optic 
deflector in the form described by Tabor.*® For a linear electro-optic 
material he obtains 


6 


100r«V 3] 1 + a= 8) 


where the symbols have the following meaning: 





CSP = Ny, = Pixs 


Pz reactive drive power 

e dielectric constant of the electro-optic material 

V, reduced half-wave voltage, that is the voltage which for an 
electrode spacing equal to the length produces half-wave phase 
retardation. 

\ the wave length of the light 

S, The length of a module consisting of a polarization switch of 
length S and a Wollaston prism of length S,, — S. 


In this formula, it is assumed that the smallest angular increment 
between addresses is given by 5\/d with d the width of the aperture. 
This assumption allows for a 4/4 wavefront distortion, the aperturing 
due to the “walkoff’,t and a guardband of 1.5\/d. It should corre- 
spond to about —20-dB crosstalk. 

Obviously the driver power should be as large as possible. The only 
inherent limit is given by the heating of the electro-optic material 
which is a consequence of the finite Q of the crystal. For most linear 
electro-optic materials, heating does not present a serious problem. 

The situation is more complicated for the quadratic electro-optic 
material KTN. Kurtz derived the capacity speed product for that 
case and particularly considered the limitations due to heating since 


* The treatment of the capacity speed product draws heavily on an unpublished 
summary of the subject by K. D. Bowers. 

+ The light entering all but the first module has undergone at least some deflec- 
tion and thus, “walked off” the optical axis. 
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the material has to be operated very close to its Curie temperature so 
that small changes in temperature will cause a large change in the 
electrical permittivity, hence in the polarization, hence in the electri- 
cally induced birefringence. 

For the evaluation of specific cases the following assumptions are 
made: 


driver power Pg = 10 W 
wave length of light A = 0.633, 
2s: 


The resulting capacity speed products are listed in Table I. For 
KDP the listed values are for the longitudinal electro-optic effect since 
in that case it is larger than the transverse electro-optic effect. The 
values for all other materials are for the transverse electro-optic effect. 
The value listed for KTN is taken from Kurtz, but adjusted for the 
higher wavelength. 

It can be seen that the highest capacity speed product achievable 
with presently known materials is on the order of 10° sec. 

The acousto-optic deflection of light has recently been reviewed by 
Gordon.? In this method, light is diffracted on the modulation of the 
refractive index produced by an acoustic wave. The deflection angle 
is changed by changing the sound frequency. The range of angular 
deflection is given by 


A@ = (A/v.) “Af. ’ 


where v, is the sound velocity and Af, is the range of the acoustic 
frequencies available. 











TABLE | 
Vr Ve CSP 
Material x = €/e0 (103 volt) (108 volt?) (109 see) 
Linear 
electro-optic effect 
KDP!" 20 8 13 0.07 
CuCl!8 8 7.2 4 0.23 
LiNbO,!9-20 32 2.8 2.5 0.37 
LiTaO, 43 2.4 2.5 0.37 
ZnTe” 10 2.9 0.84 1.1 
Quadratic 


electro-optic effect 
KTN* 1400 0.054 0.4 0.8 
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Since the incident light is apertured to the width d of the deflector, 
the emerging beam will have an angular spread due to diffraction. In 
order to have well-resolved output positions an angular separation be- 
tween neighboring addresses of 44/d is assumed which allows for a 
\/4 wave front distortion and for a guardband of 1.5A/d. This separa- 
tion is somewhat less than was assumed in the electro-optic case where 
an additional allowance had to be made for the “walkoff.” The number 
of resolvable addresses in one coordinate is thus obtained as 


n = A6/(4d/d) = (Af./4)-d/v,). 


The quantity d/v, is the transit time for the acoustic energy across the 
optical aperture, that is, the time needed to change the frequency in the 
deflector, or in other words v;/d = v,. If the X deflector is followed by 
a similar Y deflector and the two are operated simultaneously, the 
total number of resolvable addresses is 


N =n’ = (Af,/4)*(d/v,.)” = (Af,/4)?/v . 
Therefore, the capacity speed product is 


CSP = Ny, = Af,/4. 


Considering the present state of the technology it is justified to assume 
Afs = 400 MHz which gives a capacity speed product of 10° sec™. 

For both deflectors, the maximum number of addresses is limited by 
the optics of the system in conjunction with the practical limit on the 
size of the deflector element. This limit is estimated at 10¢ addresses. 
In the case of the electro-optic deflector this corresponds to an aperture 
of a few centimeters for the polarization switches, a size which will be 
dificult to exceed for electro-optic materials of adequate optical 
quality. On the other hand, in the acoustic case the generally required 
cylindrical optics makes it difficult to exceed the above limit. 

The resulting limitations in the number of addresses versus address- 
ing rate are depicted in Fig. 1 (a) and (b). 

If one stores M bits at each address, the laser energy reaching this 
address has to be distributed over the M bits. In order to detect any 
one bit, a certain minimum amount of light energy (number of photons) 
is needed at the detector for any desired signal-to-noise ratio. For a 
given light power reaching a given address, this sets an upper limit to 
the number of bits that can be accommodated at each address. 

If an ideal photon detector were used, the noise limit would corre- 
spond to 1 photon per bit reaching a detector in the time interval which 
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NUMBER OF ADDRESSES OR BITS 





ADDRESSING RATE (SEC™!) 


Fig. 1— Capacity speed limitations in an optical memory. (a) Number of 
addresses vs addressing rate for the electro-optic deflector, (b) Number of 
addresses vs addressing rate for the acousto-optic deflector. (¢c) Number of bits 
per address vs addressing rate. (d) Number of total bits vs addressing rate for 
electro-optic deflector. (e) Number of total bits vs addressing rate for acousto- 
optic deflector. 


is consistent with the desired memory cycle time. With a multiplying 
p-n junction, the noise limit is approximately 10? photons per bit and 
with a non-multiplying junction, the corresponding limit is 10* pho- 
tons." For an adequate signal-to-noise ratio, the number of photons 
reaching the detector should be 10? times the number corresponding to 
the noise limit. 

For an array of photodetectors, non-multiplying junctions offer the 
greatest economy and flexibility. Thus, a minimum of 10° photons 
per bit should reach the detector in the time in which the read-out 
is to be performed. 

At present, simple, reliable lasers operating in the visible range are 
available which emit in excess of 100 mW in continuous operation. In 
the light deflection and the projection onto the photodetectors, a frac- 
tion of the light intensity will be unavoidably lost. In a practical 
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optical system it will be very difficult to keep these losses below 10 dB. 
Accordingly, it can be assumed that 10 mW will be available at the 
detectors. If one further assumes that the rate at which bits are to be 
read out from any one detector should equal the addressing rate, an 
inverse relation between the number of bits that can be paralleled at 
one address and the bit rate per detector can be determined. Fig. 1(c) 
gives the resulting relation for a photon energy of 1.5 eV and 10° 
photons per bit. 

The product between Fig. 1(c) and Fig.1(a) and (b), respectively, 
gives the overall limit of the total bit capacity. Fig. 1(d) gives this 
product for electro-optical deflection while Fig. 1(e) gives it for 
acousto-optical deflection. These curves indicate that for high address- 
ing rates the resulting limit of the total capacity decreases with the 
third power of that rate. 

The curves permit a general assessment of the potential usefulness 
of optical memories using present state of the art components. Design- 
ing a memory that operates near its physical limits increases the 
complexity and cost; a memory designed for an addressing rate and 
’ capacity which are well removed from these limits can be built much 
more readily. 

For a memory capacity of 108 bits or more, the curves of Fig. 1 
indicate that, for such a capacity, addressing rates of 10° addresses per 
second are fairly close to the physical limitations. However, if the 
concept of paging is used and the number of desired instructions per 
address is a number between 1 and 100, depending on the program 
design, then a range of addressing rates of 10* to 10° addresses per 
second gives a net rate approaching 10° instructions per second. 


III. DESIGN CONSIDERATIONS FOR SYSTEM USING AN ACOUSTO-OPTIC 
DEFLECTOR 


A system realizing the above concept utilizes a laser light source and | 
an X Y-deflector which directs the light to any one address in a matrix 
of storage elements as depicted in Fig. 2. Each storage element has 
associated with it an optical system through which the stored informa- 
tion is projected onto a common read-out matrix. Thus, upon illumina- 
tion of a selected address a real image of an information matrix is 
generated in the read-out plane. Whether a matrix point receives light 
or not corresponds to a logical “1” or a logical “0”. In the read-out 
plane a matrix of photodiodes or other light sensitive detectors will 
convert the light into electrical signals. Therefore, by illuminating 
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Fig. 2— Schematic of optical memory system. 


any one storage element with laser light, one transfers a large block 
of information from the storage location to the location of the read-out 
matrix. 

At present, acousto-optic deflection is the most practical method of 
light deflection. Specifically, an addressing rate of 10° addresses per 
second and a total of 10* addresses is well within the range of available 
technology.? By storing 10* bits per address a total storage capacity of 
10° bits is possible. Electro-optic light deflectors may ultimately re- 
place the acousto-optic deflector since, theoretically, the former has 
a higher capacity speed product. However, due to matcrials problems 
the electro-optic light deflector is currently limited to speeds no higher 
than those achievable with the simpler acousto-optic deflector. 

The following more specific considerations are based on the above 
configuration. It will become clear that the design is sufficiently far 
from all inherent limitations so that it will be possible to increase both 
capacity and speed through appropriate refinements within the estab- 
lished technology. 

In the most straight forward approach a real image of the informa- 
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tion matrix is stored in the storage plane. The projection onto the 
read-out plane requires a separate lens system associated with each 
storage location, the resolution of which is dictated by the number of 
bits stored at each location. It might be possible that such a multi-lens 
system is realizable with molded plastic fly eye lenses. 

To keep the overall size within reasonable limits, the individual real 
image will be quite small. This would make the system very sensitive 
to dust particles and alignment. 

Since the use of coherent light has already been assumed, it is 
readily possible to use holograms as optical elements. In that case the 
lens and the storage function can be combined by preparing a hologram 
which upon illumination by the deflector generates a real image of an 
information matrix in the read-out plane. Within each hologram, the 
information about each bit is spread over the entire area,”* so that the 
sensitivity to dust particles is significantly reduced. The presence of 
dust in that case gives an overall reduction of the signal-to-noise for 
all bits stored in one hologram. Uniformity of illumination is also not 
very critical. Furthermore, the exact positioning of the individual 
hologram is not very critical. The location of the image, however, will 
depend on the angular orientation of the hologram. With a 100 x 100 
read-out matrix it will be adequate to control the angular positions of 
the individual holograms to one part in 1,000, a precision which is 
quite practical. 

In the hologram case a positional requirement exists between the 
laser and the read-out matrix. It can be expected that the alignment 
of the laser beam with regard to the read-out matrix can be much 
more readily accomplished than the alignment of the picture in the 
case where the information is stored as a real image. In particular, with 
an acoustic deflector that portion of the light which emerges unde- 
flected might be used in a positional servo-mechanism. 

In the hologram case the complications lie primarily in the fabrica- 
tion of the holograms. It appears feasible to store the required 108 bits 
of information on 10* individual holograms within an overall size of 
2 X 2 inches. 

The laser to be used in such a system should emit light of a wave- 
length of less than one micron so that silicon devices can be used as 
photodetectors. It is also desirable that the laser be operable in a 
pulsed mode. 

The best operation would be achieved with a Q-switched solid state 
laser, since, in such lasers, power can be integrated for up to approxi- 
mately 10~* seconds and the energy so stored can be released in pulses 
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as short as 10-? seconds.2* Most solid-state detectors will operate with 
a minority-carrier lifetime of less than 10-* seconds. In such devices, 
an optimum signal is obtained if the light pulse is short in comparison 
to the lifetime. A Q-switched solid-state laser will thus permit an 
optimum utilization of available laser intensity, since it is capable of 
integrating power up to the longest cycle times of interest and since 
it can discharge the energy in a time shorter than, or at worst com- 
parable to, the lifetime in typical semiconductor devices. 

In the simplest case the read-out matrix will consist of individual 
photodiodes located at each matrix point whereby all diodes can be 
read out simultaneously. For the specific case considered here, this 
requires 10* diodes and 104 read-out circuits. These circuits could be 
bi-stable elements such as flip-flops which are set by the photodiode. 

As an alternative to a diode with a flip-flop, a pnpn diode in series 
with a resistor might also be used as a bi-stable element in the read-out 
matrix. The light could switch such a device from the low-current high- 
impedance state to a high-current low-impedance state. . 

The state of the individual bi-stable circuits could be interrogated 
through a multicoordinate address selection. Once the elements are 
set, the interrogation speed is only determined by semiconductor device 
considerations and it should be readily possible to interrogate “words” 
within the “page” with access and cycle times well below 100 nsec. 

In this organization one is really dealing with a small (10+ bit)semi- 
conductor memory which is set by the optical system. This small 
semiconductor memory may be operated as a buffer store which can 
hold the information of one page for any desired pcriod of time. 

In other applications it may be necessary to have more than one 
page available in a random access high-speed buffer store. This re- 
quires that all information be rapidly transferred from the read-out 
matrix into a buffer store. In this case, the time for which the read-out 
matrix is to hold the information may become quite short and other 
methods of temporary information storage may be considered such as 
holding the information as a voltage on a capacitor. A particularly 
suitable method might be the use of photo transistors operated in the 
charge storage mode.?? In that mode the light-generated charge is 
stored in a reverse biased junction for times much longer than a life- 
time. The information is interrogated by applying a bias pulse to 
the transistor. 

The charge storage mode has the additional advantage in that it 
can be used to integrate light over fairly long times making it possible 
to use CW lasers such as HeNe lasers with such detectors. This, how- 
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ever, leads to a somewhat less efficient operation than with a Q- 
switched solid-state laser since the CW laser will not integrate inten- 
sity during the time a new address is established. 


IV. CONCLUSIONS 


The preceding considerations lead to the conclusion that a semi- 
permanent optical memory utilizing either acousto-optic or electro- 
optic light deflectors shows potential as a memory with 10° or more 
bits capacity and with page access as short as 1 psec. Specifically, a 
memory with 10° bits capacity using acousto-optic deflectors can easily 
have page access times of 10 psec. Such a memory using holograms as 
a storage medium and semiconductor devices as read-out elements 
appears well within the state of the art. Significantly shorter access 
times for similar capacities will probably require electro-optic de- 
flectors using materials which are not yet available in adequate optical 
quality. 
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Estimation of the Variance of a Stationary 
Gaussian Random Process by Periodic Sampling 


By J. C. DALE 


(Manuscript received February 14, 1967) 


I. INTRODUCTION 


This paper* applies previous work? on estimation of the mean of a 
stationary random process by periodic sampling to estimation of the 
variance with the added restriction that the process under considera- 
tion be Gaussian with known mean. 

The samples are taken from a sample function of the random proc- 
ess, in a closed interval (0, 7’) and are in general correlated. The esti- 
mator used is the average of equally-weighted squared samples. The 
variance of this estimator is derived and its behavior is predicted as a 
function of the number of samples and length of record. 


II. THEORY 


2.1 General 


Let x(t) be a sample function from a stationary, Gaussian random 
process {x(¢)} with known mean.i 
An unbiased estimator of the variance is given by 


ee ee, 
tae Beh) @) 


k=0 





where T'/N is the sampling period. 
By invoking the Gaussian assumption, the variance of this estimator 
follows directly and is given by 


7 (A) 2 = — LET le EE) 
var (#2) = FD (1 w+ 1/PN) ? 


* This work was supported by the U.S. Navy, Bureau of Ships under Contract 
N00 600-67-CO549. 

t+ So long as the mean of {2(t)} is assumed known, no generality of the deriva- 
tion is lost by letting it be zero. 








1283 


1284 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1967 


or in an equivalent form (2) 


a2 2 = 2 — ( Kr) 
j=. : R ir — =} dr. 
Ey SNe 2B Gores r( Ree) 2a Br — ay) a 


Qww+1/v)r(T) is the triangular weighting function (See Ref. 1) and R,(7) 
is the autocorrelation function of {x(t)}. 

By comparing the var (62) to (3) in the previous paper’ it is seen 
that (2) gives the variance of the sample mean of a stationary random 
process {2x'(é)}, whose autocovariance function is given by 2R?(r). 
The spectrum of {2°(é)} is 2S,(w) * S,(w). 

At this point the previous theory’ applies directly. Using the same 
notation, the spectrum of the squared samples can be written as 


2 ? 2 —jwr : kT | 
G@) = rane Qweiswy r(7)R2(r)e ys i(; oe ) on 


k=—@ 


which is equivalent to (3) 


ate) = Fle — a), 
In this case 


FW) = Q@) * 2[8.@) * S.@)], (4) 
and Q(w) is the transform of the weighting function. 

G(w) can be interpreted as F'(w), shifted by integral multiples of the 
sampling frequency, 2xN/7. As before to obtain the variance of the 
estimate we need only be concerned with the value of G(w) at wo = 0. 
To minimize the variance of the estimate, the sampling frequency 
should be high enough to prevent overlapping of the sideband at w = 0. 
Satisfying this condition results in 


var (2) = 7 FO).4 (5) 


To answer the question of how many samples to take in time T' to 
obtain minimum variance, consider (4). Q(w) is approximately zero for 


t Equation (5) is not quite true when both end points of the time record are in- 
cluded as samples. This is because (N/T) F(0) is a function of N namely, 


sin y(N + —— | 

—__—_—__—__—___—— | dy. 
y(N + 1/2N)T 

Increasing N beyond the value given in (8) actually results in a higher variance on 


the estimate. This is apparent in the two examples, particularly for T small. This 
same effect was discussed in Ref. 1. 


1 _—o 
(N/T) F(0) = — ie 28.0 + Sco) 
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lw] = (27/T)[N/(N + 1]. If S,(w) is zero for |w| = 2rB, then 
S.(w) * S,(w) will be zero for |w| = 47B and F(w) will be approximately 
0 for 








1 N 
sesfeetGk @ 
Therefore, choosing the sampling frequency so that 
2nN os a 
T = 2x| 28 +4(-85)| @) 


results in (5) being satisfied. 
Solving (7) for N yields the required number of samples taken in 
time 7’ to approximately minimize the variance of the estimate, namely 


1+ V1+ 2/BT |. 
2 


N = 2BT| (8) 


For BT > 1, N is approximately equal to 2B7.. Thus, twice the num- 
ber of samples are required to obtain a minimum variance estimate 
of the variance than was previously shown to obtain a minimum vari- 
ance estimate of the mean. 


2.2 Variance For Large T 


If T is allowed to become large Q(w) will approach a delta function, 





limit Q@) = ve - 6). (9) 
This results in 
Pe) = apy |, S80 — ay. (10) 


If S,(w) is zero for | » | 2 27B, and the sampling frequency satisfies 
(7), then the minimum value of variance of the estimate is given by 


var #2) lan = FPO =F; | Oa aD 


T large 


~ tf 2 
OS el [- S:(y) dy. 


This is the same value obtained by continuous sampling. 
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UI. EXAMPLES 


The variance of the estimate of variance as a function of number of 
samples (V+1) and length of record (7) has been computed for two 
examples. 

The computation was done using an expression equivalent to (2). 


3.1 Rectangular Spectrum 


2; —2r <w < 2r 
— <2) 5) 
S.() : ; elsewhere. (12) 


Fig. 1 shows var (é2) plotted against number of samples. Each curve 
represents a different length of record as indicated by the values shown 
on the figure. It should be noted that the minimum value of var (é?) 
occurs at the number of samples predicted by (8). Also for small values 
of T the var (é2) reaches a minimum and then increases as more samples 
are taken. This,is due to including both end points of the time record 
as samples, 
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Fig. 1— Variance of the sample variance as a function of the number of sam- 
ples and length of record for a process with rectangular spectral density. 
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Vig. 2— Variance of the sample variance as a function of the number of sam- 
ples and length of record for a process with Markoff spectral density. 


3.2 Markoff Spectrum 

2 . 
This sample shows a nonbandlimited spectrum. The results are shown 
in Fig, 2. 





S.(@@) = re (13) 


IV. CONCLUSION 


By making the assumption that the random process {x(t)} was 
Gaussian, it was possible to express var (é2) into an array of terms 
containing Ri(kT/N), (2). In this form it is possible to apply the theory 
developed in the work on estimation of the mean.’ The interesting result 
from this derivation was that when BT > 1, the variance of the sample 
variance is essentially minimized when 2B7 samples are taken. This 
is in contrast to the BT samples required to minimize the variance of 
sample mean. 
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A Floating Gate and Its Application 
to Memory Devices 


By D. KAHNG and S. M. SZE 


(Manuscript received May 16, 1967) 


A structure has been proposed and fabricated in which semi- 
permanent charge storage is possible. A floating gate is placed a small 
distance from an electron source. When an appropriately high field 
is applied through an outer gate, the floating gate charges up. The 
charges are stored even after the removal of the charging field due 
to much lower back transport probability. Stored-charge density of 
the order of 10'%/em? has been achieved and detected by a structure 
similar to an metal-insulator-semiconductor (MIS) field effect transis- 
tor. Such a device functions as a bistable memory with nondestructive 
read-out features. The memory holding time observed was longer than 
one hour. These preliminary results are in fair agreement with a simple 
analysis. 

It has been recognized for some time that a field-effect device, such 
as that described by Shockley and Pearson,! can be made bistable 
utilizing switchable permanent displacement charges on ferroelectric 
material.2 Subsequent studies of ferroelectric material have revealed,’ 
however, that the inherent speed capability of a device incorporating 
a ferroelectric material is limited by domain motion, whose highest 
speed is limited by the acoustic velocity. In the absence of highly 
ordered, near-ideal thin film ferroelectric material, the speed capability 
of a bistable device, therefore, is in the microsecond range at best.* In 
addition, many ferroelectric materials suffer from irreversible mechani- 
cal disorder after many cycles of polarization switching,? rendering 
some uncertainty on the long term device reliability aspect. 

An alternative to a ferroelectric gate is a floating gate chargeable by 
field emission, which hopefully circumvents the above mentioned diffi- 
culties. Consider a sandwich structure, metal M(1), insulator I(1), 
metal M (2), insulator I(2), and finally metal 17(3). (See Fig. 1). If 
the thickness of I(1) is small enough so that a field-controlled electron 
transport mechanism such as tunneling or internal tunnel-hopping 
are possible, a positive bias on M(3) with respect to M(1) with M (2) 
floating [M(2) is called the floating gate henceforth], would cause 
electron accumulation in the floating gate, provided electron transport 
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S IQ) M(2) I(2) M(3) 
(OR M(1)) 





sted beady 


€1) (€2) 





(b) 


Fig. 1— Energy band diagram of a floating gate structure with a semiconductor- 
insulator-metal-insulator-metal sandwich. For calculation of the stored charge, the 
semiconductor is replaced by a metal M(1). (a) When a positive voltage step is 
applied to the outer gate. (b) When the voltage is removed. The stored charge 
@ causes an inversion of the semiconductor surface. 


across [(2) is small. These conditions can be met by choosing J(1) 
and I(2) such that the ratio of dielectric permittivity «/e. is small 
and/or the barrier height into 7(1) is smaller than that into I(2). The 
sandwich structure is somewhat similar to the tunnel emitter metal- 
base transistor proposed by Mead? in its structure but with the follow- 
ing essential differences. 


(2) M(2) is much thicker than the hot electron range, so that 
emitted electrons are close to the Fermi-level of M/(2) before 
reaching J (2). 

(2) No carrier transport is allowed across [(2). 

(11) M(2) is floating. 
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The stored charge Q, as a function of time when a step voltage func- 
tion with amplitude V is applied across the sandwich, is given by 


Q(t) = [ 3 dt! coul/em’,. (1) 


When the emission is of Fowler-Nordheim tunneling type, then the 
current density, j, has the form 


j = CLL? exp (—L)/E), (2a) 


where C; and Ep are constants in terms of effective mass and the barrier 
height. (We have neglected the effects due to the image force lowering® 
of the barrier, etc., but the essential feature is expected to be retained 
even after detailed corrections are made). This type of current trans- 
port occurs in SiOz and Al,Os3. 

When the field emission is of the internal Schottky or Frankel-Poole 
type, as occurs in SigN,,’ then 7 follows the form 


j = C.F exp [—q(®, — VqE/ne,)/kT), (2b) 


where Cz is a constant in terms of trapping density in the insulator, ®, 
the barrier height in volts, «, the dynamic permittivity. 

The electric field in I(1) at all times is a function of the applied 
voltage V and Q(¢), and is obtainable from the displacement continuity 
requirement as 


Beers re ee 
a d, + d2(e:/€2) €: + €2(d;/de) ’ 


where d; and dz are the thickness of I(1) and I(2), respectively. 
Fig. 2(a) shows the results of a theoretical computation using (1), 
(2a), and (8) with the following parameters: d, = 50 A, e = 3.8 «9 (for 
S102), d2 = 1000 A, e = 30 «) (for ZrO»), and V = 50 volts. One notes 
that the stored charge initially increases linearly with time and then 
saturates. The current is almost constant for a short time and then 
decreases rapidly. The field in (1) decreases slightly as the time 
increases. The above results can be explained as follows: When a 
voltage pulse is applied at ¢ = 0, the initial charge Q is zero, and the 
initial electric field across Z(1) has its maximum value, Emax = 
V/[di + (e/ee)d2]. As ¢ increases, Q will first increase linearly with 
time. This is because of the fact that for small Q such that # remains 
essentially the same, the current will in turn remain the same, so 
Q = j(Emax) * t. Eventually, when Q is large enough to reduce the 


B (3) 
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Fig. 2(a) — Theoretical results of the stored-charge density (Q), the current 
density (J), and the electric field across 1(1) as a function of time. V = 50 volts, 
d, == 50 A, a/eo = 3.8 (for SiOz), ds = 1000 A, ex/eo = 30 (for ZrOz). (b) The- 
oretical results of the stored charge density as a. function of time with 
the same e and e as in (a), and d, = 10 A, d» = 100 A (solid lines), 
d, — 30 A, dz = 300 A (dotted lines). (ec) Theoretical results of the stored density 
as a function of time with d, = 20 A, a/eo = 60 (for SisNs), dz = 200 A, 
e2o/eé9 = 30 (for ZrOz) and various applied voltages. 


value of # substantially, then the current will decrease rapidly with 
time and Q increases slowly. 

Fig. 2(b) shows the stored charge as a function of time for the time 
e, and eo but different d;, d2, and V. It is clear that for a given structure, 
in order to store a given amount of charge, one can either increase 
the applied voltage or increase the charging time (pulse width) or both. 
Fig. 2(c) shows the calculated stored charge for the current transport 
described by (2b). Here 7(1) is a 20 A thick Sij;N, film. There are 
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marked decreases in the gate voltages required for a given charge 
compared to Al,O3, SiOz. This is largely due to the much lower barrier 
height (1.3 volts)? compared to SiO. (= 4.0 volts) .® 

It is noted that the field in [(1) for appreciable charge storage is in 
the 107 V/cm range. When the outer gate voltage is removed, the field 
in I(1) due to the stored charge on the inner gate is only 10° V/cm 
or so corresponding to 5x10? charges/cm?, a large enough charge to 
detect easily. Since the transport across I(1) is highly sensitive to the 
field, (2a) and (2b), no charges flow back. The charge loss is actually 
controlled by the dielectric relaxation time of the sandwich structure,® 
which is very long. When it is desired to discharge the floating gate 
quickly, it is necessary to apply to the outer gate a voltage about equal 
in magnitude but opposite in polarity to the voltage which was used 
for charging. It is evident that net positive charges (loss of electrons) 
can also be stored in the floating gate if the discharging gate voltages 
are appropriately chosen in magnitude and duration. 

It was mentioned that the stored-charge density of 5101*/cm? 
was sufficient for easy detection. One of the detection or read-out 
schemes is to use the surface field effect transistor (MOSFET or 
IGFET) first fabricated and described by Kahng and Atalla?® in 1960. 
For inversion at a silicon surface, the charge required is only about 
2X10"%/cm? for 1 ohm-cm n-type silicon. However, surface-state 
charges at the silicon-silicon dioxide interface may be as high as 
_ 101#/cm?, depending on the fabrication techniques used. For this reason 
we have chosen 5X10'2/cm? as the stored charge required for easy 
detection. When the Insulated Gate Field Effect Transistor (IGFET) 
principle is used for read-out, M(1) is now replaced by silicon. This 
requires a slight correction in the calculation of charge flow through 
the insulator, but the major features of the results are not expected to 
be altered significantly. It is to be noted that about one half of the 
stored charge can be active in creation of the inversion layer since 
the other half resides near the M(2)-I(2) interface due to Colombic 
repulsion. 

To check the feasibility, a floating gate device has been fabricated 
using an IGFET as shown in Fig. 3(a). The substrate is an n-type 
silicon, 1 ohm-cm, and (111) oriented. Z(1) is a 50 A SiO, thermally 
grown in a dry oxygen furnace. M(2) and I(2) are Zr (1000 A) and 
ZrO, (1000 A), respectively. 17(3) and the ohmic contact metals are 
aluminum deposited in a vacuum system. Fig. 3(b) is another version 
of the floating gate device using a thin film transistor (TFT) struc- 
ture.1* 
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Fig. 3— (a) Schematic diagram of a floating gate device using an IGFET. The 
numbers indicated correspond to those shown in Fig. 1. (b) Schematic diagram 
of a floating gate device using thin film transistor structure. 


The IGFET-type floating gate devices have been tested in a pulsing 
circuit. Because of the relatively thick insulator layers, large voltages 
(= 50 V) and long pulse width (~ 0.5 ps) have to be applied in order 
to store ~ 5 X 10° charges/cm*. Fig. 4 shows the experimental results. 
A positive pulse of 50 volts is first applied to the gate electrode, and 
60 ms later a negative pulse of 50 volts is applied. Then the pulsing 
cycle repeats. In Fig. 4(a) the pulse widths are 0.5 ws. One notes that 
when the positive pulse is applied, a sufficient amount of charge is 
stored in the floating gate so that the silicon surface is inverted; a 
conducting channel is thus formed, and the channel current is “on.” It 
can be seen that the channel current decreases only slightly at the end 
of 60 ms. When the negative pulse is applied, the stored charge is 
eliminated, and also the channel. The channel current reduces to its 





Fig. 4— Experimental results of the channel current of a IGFET-type floating 
gate device. A positive voltage pulse, Vi, with pulse width W,, is first applied to 
the gate, and 60 ms later a negative pulse V2 with pulse width Ws is applied. Then 
the pulsing cycle repeats. Horizontal scale: 20 ms/div. Vertical scale: 0.1 ma/div. 
- Vi = V2 = 50 volts, Wi = We = 0.5 us. (b) Vi = Ve = 40 volts, Wi = We = 

5 us, 
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“off” state. Fig. 4(b) shows results for pulses with the same widths 
but smaller amplitude (40 V). Since the stored charge is a strong 
function of the pulse amplitude, only a very small amount of charge is 
stored, too small to cause inversion. For non-leaky units, the memory 
holding time of longer than one hour has been observed. 

It is clear that a modified IGFET such as a TFT can be used for 
read-out, as shown in Fig. 3(b). For an academic study of device opera- 
tion, the floating gate can be partially exposed and a potential probe 
can be placed nearby. 

In conclusion, it has been demonstrated that the controlled field 
emission to the buried “floating” gate may be capacitively induced by 
pulsing the outer gate electrode. This combination can therefore, be 
used as a memory device, with holding time as long as the dielectric 
relaxation time of the gate structure and with continuous nondestruc- 
tive read-out capability. There seems to be no inherent reason why 
read-in read-out cannot be performed in a very short time, say in the 
nanosecond range or even shorter. 

We wish to acknowledge helpful discussions and technical assistance 
rendered by M. P. Lepselter and P. A. Byrnes in connection with ZrOz, 
and skillful technical assistance given to us by G. P. Carey and 
A. Loya, and J. F. Grandner and his group. 
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Semipermanent Memory Using Capacitor Charge 
Storage and IGFET Read-out 


By D. KAHNG 


(Manuscript received May 17, 1967) 


One of the earliest computers used capacitors as its memory.t A 
mechanical means was used for both read-in and read-out operations. 
Electronic accessing was used in conjunction with vacuum tube or 
solid-state diodes in relatively modern computers such as the SEAC 
computer.” Capacitor storage is rarely used at present since magnetic 
memories meet the modern computer requirements much better. The 
inherent difficulty with capacitor storage was the limited holding time 
since a nonlinear resistor with small enough leakage currents to allow 
useful memory holding time was then not readily available. The old 
capacitor memory was charged through a diode with slow recovery 
time and with leakage current of 10° amp at best and required a 
large capacitor for any appreciable holding time. Furthermore, the 
read-out was usually destructive. 

The capacitor storage merits re-examination in view of the advanced 
solid-state devices and technology now available. Coupled with an 
Insulated Gate Field Effect Transistor (IGFET) ,? the read-out can 
be nondestructive. Integrated circuit techniques may prove superior 
to the current magnetic memories for some applications where in- 
frequent recycling is permissible. The inherent speed should be much 
faster than that of magnetic units. . 

Consider a capacitor C in series with a nonlinear element as shown 
in Fig. 1. The capacitor may represent the gate capacity of the IGFET 
plus any external capacitor in parallel with the gate capacitor. When a 
positive voltage pulse with amplitude V and duration + is applied at 
the nonlinear element terminal, it can be shown that the stored charge 
Q(r) and the decay time constant 7,, defined as the time required to 
reach 1/e value of the initial stored charge Q,, can be calculated for 
various nonlinear resistors. 


I. POWER-LAW RESISTORS 
The I-V characteristics are given by 


I = KV", (1) 
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then 
1 = —1/(m-1) 
Q(z) = ov - (cdr + K(m — 1) =) | (2) 
and 
CG" 1 Z 
‘= oS ene Sad) 
sles = De 
~~ (m—1) Ih’ 


where I, is the discharge current at the termination of charging. It is 
clear from (2) that m = 1 for physically meaningful Q(7). For a long 
holding time, (3) tells us that the nonlinearity of the resistor should 
be large. These equations would describe the behavior of the storage 
unit comprising a space-charge-limited-current diode.* If traps are 
present, only a simple modification is needed in the analysis. Structures 
comprising photosensitive space-charge-limited-current diodes such as 
CdS diodes should allow optical read-in operations which might be 
advantageous for certain applications such as a vidicon. 


Il, TUNNEL SANDWICH DIODES 


For this nonlinear element, the circuit in Fig. 1 should be modified 
to include the shunt capacitance of the tunnel sandwich. Thus, at the 
instance of pulse application, the voltage divides between the two 
capacitors. The I-V relationship for Fowler-Nordheim type tunneling 
is - 


PSR, (4) 


With the appropriate initial conditions, the stored charge Q(r) is given 
by 


Q(1) = | v — aV,/In (crv te AM 7) | (5) 
Lal wee ie 
V 
ee ee 


Fig. 1—A capacitor being charged through a nonlinear resistor. 
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where « is defined as unity plus the ratio of the shunt capacitance of 
the tunnel sandwich, C, to the charging capacitor, namely a = 1 + 
(C,/C). 


The discharge time constant 7, can also be shown to be 


aC (eee nee” =a 


_ KY, 





ce" CVo/Qo 


(6) 


= (aC Vo/Qv) (e-2) 
TL . 


It is clear that charging time required for adequately large Q is very 
short, allowing fast read-in operation. The decay time can be seen to 
be large for tunneling across well-known insulators such as SiOz, and 
Al.O3. Therefore, the decay is not controlled by (6) but rather by the 
dielectric relaxation time of the insulators used for the entire assembly 
including the IGFET gate material. The dielectric relaxation time of 
the best inorganic insulators is of the order of one day at room tem- 
peratures. However, certain organic insulators are known to have 
theoretical dielectric relaxation times of many years. Performance of 
a memory cell incorporating a tunnel sandwich diode is described in — 
more detail elsewhere.® 


III. SCHOTTKY BARRIER DIODES 


For charging through a rectifier, Schottky barrier diodes are pre- 
ferred over pn junction diodes since Schottky barriers are majority 
carrier devices and hence fast recovery is achievable.® I-V charac- 
teristics may be represented by 

I = 1,(e" — 1). (7) 
For charging, we may neglect the unity in the bracket in (7), and the 
stored charge can be shown to be 


Q(r) = cv ~ ain (= aii em) | (8) 


where 





For decay, it is easy to show 


pfs i ety (9) 
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Fig. 2 shows the stored charge Q computed from (8) as a function 
of pulse duration for several pulse amplitudes. The characteristic time 
constant 7, is not much less than 1 sec for a typical configuration 
(C < 10° Fd, J, > 1072 amp). Therefore, it is seen that the stored 
charge is proportional to the pulse amplitude for sufficiently large V. 
This suggests that the device may be used as a multi-level storage 
unit. 

The combination of Schottky barrier diodes and the IGFET is 
shown in Fig. 3. A similar structure has been fabricated and tested. 
The holding time was of the order of 10 sec when the charging was 
done by 15 volts pulse in agreement with (9) since the IGFET gate 
capacitance was about 10-1? Fd and J, of the diode was 107%? amp 
(measured at 10 volts reverse bias). The maximum pulse amplitude 
before the breakdown of the gate insulator was about 30 volts, allow- 
ing a longer holding time of about 30 seconds. The read-in time was 
less than 10-7 second and the reverse breakdown of the diode was used 
to turn the IGFET off, which also took less than 10-‘ second. 

A memory featuring nondestructive read-out, access times in the 
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Fig. 2— Theoretical stored charge/storing capacitance (the floating potential) 
as a function of time for charging through a Schottky barrier diode. 
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Fig. 3— Combination of the capacitor storage unit with a p-channel IGFET 
for read-out. (a) Series connection, read-in should be positive. (b) Parallel connec- 
tion, read-in should be negative. 


submicrosecond range, and holding times of many seconds should find 
many applications. An integrated structure incorporating Schottky bar- 
rier diodes and IGFETs is readily obtainable with modern solid-state 
technology. 
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